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WD- 40 - DERIVED PEPTIDES AND USES THEREOF 

Field of the Invention 

• The present invention relates in general to 

I compositions and methods of modulating the function of proteins 

* 5 involved in protein-protein interactions. It relates more 

specifically to modulating the function of a first protein of a 
pair of interacting proteins wherein a second protein of the 
pair contains a "WD-40" or "6-transducin" amino acid repeat 
motif. 



10 Background Art 

Many intracellular processes are carried out or 
regulated by multi-subunit protein complexes that become active 
or repressed by the association or dissociation of individual 
polypeptide subunits. 

15 One such group or family of proteins is related to the 

& subunit of transducin. Members of this group are all at least 
somewhat homologous to the fi- subunit of transducin at the amino 
acid level, and contain a varying number of repeats of a 
particular motif identified in S- transducin. The repeats have 

20 been termed "S-transducin" , or "WD-40" repeats (Fong, et al.) • 
Among the members of this protein family (Duronio, et 
al.) are the GjS subunits that couple many receptors to their 
intracellular effector molecules, G/J/y subunits that anchor 
another protein kinase {the /?- adrenergic receptor kinase, iSARK) , 

25 DNA binding proteins and yeast cell cycle proteins. All of 

these require a transient protein-protein interaction for their 
function. However, the sequences at the interface of these 
proteins and their partners have not been identified. 

Th3 following are the references cited above and 

3 0 thr.rughcut th* -^-^ecif lciT;:.on: 



U.S. Patent Documents 

Crea, R. , U.S. Patent No. 4,888,286, issued December 

19, 1989. 

Eaton, M.A.W., et al., U.S. Patent No. 4,719,180, 
35 issued Jan. 12, 1988. 
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Yoshio, T,, et al., U.S. Patent No. 4,849,350, issued 
July 18, 1989. 

Other References 

Ausubel, F. M., et al.. Current Protocola in Molecular | 
5 Biolocrv , John Wiley and Sons, Inc., Media PA. 

Bohinski, R.C., Modem Concepts in Biochemistry . 
Second Edition, Allyn and Bacon, Inc. 

Dayhoff, M.O., in Atlas of Protein Sequence and 
Structure (1972) Vol. 5, National Biomedical Research 
10 Foundation, pp. 101-110, and Supplement 2 to this volume, pp. 1- 
10. 

Duronio, R.J., et al., (1992) Proteins: Structure, 
Function, and Genetics 13:41-56. 

Escobedo, J. A., etal., Mol. Cell, Biol., 11:1125-1132 

15 (1991). 

Pong, et al., (1986) Proc Natl Acad Sci USA 83:2162- 

2166. 

Hari, etal., Endocrinology, 120:829-831 (1987). 

Kleuss, C, etal.. Science 259:832-834 (1993). 
20 Makowske, O.M. and Rosen, O.M. J. Biol. Chem. 

264:16155-16159 (1989) 

Maniatis, T., et al.. Molecular Cloning: A Laboratory 
Manual, Cold Spring Harbor Laboratory (1982) . 

Miller, J.F., etal., Nature (London) 216:659-63 

25 (1969) . 

Mochly-Rosen, D., and Koshland, D. E., Jr. J. Biol. 
Chem. 262:2291-2297 (1987). 

Mochly-Rosen, et al., Molec. Biol. Cell. 1:693-706 

(1990) . 

^0 Mochly-Ros-an, D., et al., Proc. Natl. Acad. Sci. USA 

23 :35r7-4000 (1?91) . 

Orr, J.W., etal., J. Biol. Chem. 267, 16155-16153 

(1992) 

Pitcher, J., etal., Science 257:1264-1267 (1992). 
35 Reiner, etal.. Nature 364:717-721 (1993). 

Schulz, G.E. and R.H. Schirmer., Principles of Protein 
Structure. Springer- Verlag . 
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Smith, B.L. and Mochly-Rosen, D. Biochem. Biophys. 
Res. Coimun. 188:1235-1240 (1992). 

Smith, D,B., et al., Gene 67:31 (1988). 

Stith, B-J. and J.L. Mailer. Exp. Cell. Rea. 169:514- 
523 (1987) . 

Wolf, M. and N. Sahyoun, Chem., 261:13327-13332 

(1986) . 

Disclosure of the Invention 

The invention includes, in one aspect, a polypeptide 
composition effective to alter the activity of a first protein, 
such as protein kinase C, or ^-adrenergic receptor kinase 
(/?ARK) . The polypeptide blocks or inhibits an interaction, such 
as a binding interaction, between the first protein and a second 
protein containing a WD- 40 region. 

The polypeptide contains between 4 and 50 amino acids 
whose sequence is the same as a sequence of the same length in 
the WD-40 region of the second protein. 

The polypeptide may block the binding of the first to 
the second protein, or may be an agonist or antagonist of the 
first protein. The WD-40 region preferably has an amino acid 
sequence homologous or identical to the sequences defined by SEQ 
ID NO:76-261. 

In a second embodiment, the invention includes a 
method of altering the activity of the first protein of the type 
defined above. The method includes selecting a polypeptide 
having between 4 and 50 amino acids whose sequence is the same 
as a sequence of the same length in the WD-40 region of the 
second protein, and contacting the polypeptide with the first 
protein under conditions which allow the formation of a complex 
between the polypeptide and the first protein, -.-here this 
incer-cticn alters the activity cf \'''.e first t:rc*:ain. 

In one embodiment, the contacting is effective to 
inhibit the interaction between the first and second proteins. 
In another embodiment, the contacting is effective to stimulate 
the activity of the first protein. 

In still another embodiment, the contacting is 
effective to inhibit the activity of the first protein. 
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The polypeptide preferably has an amino acid sequence 
homologous or identical to the sequences defined by SEQ ID 
NO:76-261. 

In a more specific aspect of the invention, the 
5 invention includes a polypeptide composition effective to alter 
the activity of protein kinase C, where the protein kinase C 
interacts with a second protein, and the second protein contains 
at least one WD-40 region. The polypeptide has between 4 and 50 
amino acids whose sequence is the same as a sequence of the same 
10 length in the WD-40 region of the second protein. 

In a preferred embodiment, the second protein is a 
receptor for activated protein kinase C, and has the sequence 
represented by SEQ ID NO: 27. 

In other specific embodiments, the polypeptide is (i) 
15 an agonist of protein kinase C, and the polypeptide has the 

sequence represented by SEQ ID N0:7; (ii) an antagonist of the 
activity of protein kinase C; and/or (iii) an inhibitor of the 
interaction between protein kinase C and the second protein. In 
the latter embodiment, the polypeptide has sequence 
20 corresponding to SEQ ID NO: 4 or SEQ ID NO: 7. 

The WD-40 region preferably has an amino acid sequence 
homologous or identical to SEQ ID NO: 69-75. 

In a related embodiment, the invention includes a 
method of altering the activity of a protein kinase C that 
25 interacts with a second protein, where said second protein 
contains at least one WD-40 region. 

The method includes selecting a polypeptide having 
between 4 and 50 amino acids whose sequence is the same as a 
sequence of the same length in the WD-40 region of the second 
protein, and contacting the polypeptide with the protein kinase 
C under ccniicions vhich allow the formation of a ccT.plox 
betv-en t::- -o:vt-p^:L-^H and th- prot-in kinase C, wheir- said 
interaction alters the activity of said protein kinase C. 

Other aspects of the invention include the polypepcica 
35 compositions of the invention wherein said polypeptide is 
coupled to a solid support, as well as a method to bind 
selectively said first protein which method comprises contacting 
a sample putatively containing said first protein with the 



30 
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polypeptide composition bound to solid support and removing any- 
unbound components of the sample from said composition. 

In still another aspect, the invention relates to a 
method to assess the interaction of a first protein with a 
5 polypeptide represented by an amino acid sequence contained in a 
second protein, wherein said second protein contains at least 
one WD-40 region, which method comprises contacting a sample 
containing said first protein with a polypeptide composition 
wherein the polypeptide has between 4 and 50 amino acids whose 
10 sequence is the same as the sequence of the same length in the 
WD-40 region of the second protein, and observing any 
interaction of the first protein with said polypeptide 
composition. The invention also concerns a method to assess the 
ability of a candidate compound to bind a first protein which 
15 method comprises contacting said first protein with a 

polypeptide composition which binds said first protein, wherein 
the polypeptide of said composition has between 4 and 50 amino 
acids whose sequence is the same as a sequence of the same 
length in a WD-40 region of a second protein which interacts 
20 with said first protein, in the presence and absence of said 
candidate compound; and measuring the binding of said 
polypeptide in the presence and in the absence of said 
candidate, wherein decreased binding of the polypeptide in the 
presence as opposed to the absence of said candidate indicates 
25 that said candidate binds to said first protein. 

In still another aspect, the invention is directed to 
recombinant materials for the production of the polypeptides of 
the invention and methods for their production. 

These and other objects and features of the invention 
will become more fully apparent when the following detailed 
description of the invention is read in conjunction with the 
a.c r r-ipanying drav/ir. .7s . 

Brief Desnr iption of the Fioiirpg 

Figure lA shows the cDNA sequence of rat brain RACKl. 
Figure IB shows an amino acid self -homology matrix 
analysis of RACKl. 
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Figure IC shows the amino acid sequence of RACKl, 
aligned to show the seven WD-40 repeiats represented in the 
molecule . 

Figure 2 shows the results of an overlay assay to 
5 detect PKC binding to immobilized RACKl in the presence and 
absence of PKC activators* 

Figure 3 shows the results of an overlay assay to 
detect PKC binding to immobilized RACKl in the presence and 
absence of WD-40-derived peptides. 
10 Figure 4 shows the results of an overlay assay to 

detect binding of jSPKC to either peptide I (SEQ ID N0:1) or 
peptide rVI (SEQ ID NO: 7) immobilized on nitrocellulose 
membranes under various conditions. 

Figure 5A shows the effects of injecting peptides I 
15 (SEQ ID N0:1) and rVI (SEQ ID N0:7) on PKC-mediated germinal 
vesicle breakdown (GVBD) , a measure of • insulin- induced oocyte 
maturation. 

Figure 5B shows the effects of injecting peptides I 
(SEQ ID N0:1) and rVI (SEQ ID NO: 7) on PKC-mediated germinal 
20 vesicle breakdown (GVBD) in the absence of insulin induction. 

Figure 5C shows the effects of injecting peptide rlll 
(SEQ ID NO: 4) on PKC-mediated germinal vesicle breakdown (GVBD) 
in the absence of insulin induction. 

Figure 6 shows the distribution of /?PKC in Xenopus 
25 oocytes between the cytosolic and membrane-associated fractions 
following microinjection of either injection solution, peptide I 
(SEQ ID N0:1) or peptide rVI (SEQ ID N0:7) with or without 
insulin stimulation. 

Figure 7 shows the effects of peptides I and rVI on 
30 the sensitivity of jSl^KC to Arg-C endcpeptidase . 

riciure 3 s-hcv;s the effects of peptides I and rVI on 
PK'J •iutc::::o?ioh:ryic:lcn in th-i absence cf PKC activators. 

Figure 9 shows the effects of peptides I and rVI ::. 
PKC phosphorylation of histones in the absence of PKC 
35 activators. 

Figure 10 shows the effects of peptide rlll on PKC 
phosphorylation of histones in the absence of PKC activators. 
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Figure 11 shows the amino acid sequence of the 56 kDa 
human protein with the vnD-40 repeats aligned and putative 
binding peptide regions delineated by a box. 
« Figure 12 shows the amino acid sequence of the AAC- 

5 rich protein with the WD-40 repeats aligned and putative binding 
5 peptide regions delineated by a box. 

Figure 13 shows the amino acid sequence of the B-TRCP 
protein with the WD-40 repeats aligned and putative binding 
peptide regions delineated by a box. 
10 Figure 14 shows the amino acid sequence of the Beta- 

prime-COP protein with the vnD-40 repeats aligned and putative 
binding peptide regions delineated by a box. 

Figure 15 shows the amino acid sequence of the CDC4 
protein with the WD-40 repeats aligned and putative binding 
15 peptide regions delineated by a box. 

Figure 16 shows the amino acid sequence of the Chlam-3 
protein with the WD-40 repeats aligned and putative binding 
peptide regions delineated by a box. 

Figure 17 shows the amino acid sequence of the COP-1 
20 protein with the WD-40 repeats aligned and putative binding 
peptide regions delineated by a box. 

Figure 18 shows the amino acid sequence of the CORO 
protein with the WD-40 repeats aligned and putative binding 
peptide regions delineated by a box. 
25 Figure 19 shows the amino acid sequence of the Coronin 

p55 protein with the WD-40 repeats aligned and putative binding 
peptide regions delineated by a box. 

Figure 20 shows the amino acid sequence of the Cstf 50 
kDa protein with the iro-40 repeats aligned and putative binding 
30 peptide regions delineated by a box. 

Figure 21 ^hows the amino acid saquence of the bovine 
G-l:::ta-l proteiT. v/ith the ^'D-40 rspe^.t^ aligned and putative 
binding peptide regions delineated by a box. 

Figure 22 shows the amino acid sequence of the bovine 
• 35 G-beta-2 protein with the WD-40 repeats aligned and putative 
binding peptide regions delineated by a box. 
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Figure 23 shows the amino acid sequence of the 
drosophila G-beta protein with the WD-40 repeats aligned and 
putative binding peptide regions delineated by a box. 

Figure 24 shows the amino acid sequence of the human 
5 G-beta- 1 protein with the WD-40 repeats aligned and putative 
binding peptide regions delineated by a box. 

Figure 25 shows the amino acid sequence of the human 
G-beta-2 protein with the WD-40 repeats aligned and putative 
binding peptide regions delineated by a box. 
10 Figure 26 shows the amino acid sequence of the mouse 

G-beta protein with the WD-40 repeats aligned and putative 
binding peptide regions delineated by a box. 

Figure 27 shows the amino acid sequence of the 
drosophila groucho protein with the WD-40 repeats aligned and 
15 putative binding peptide regions delineated by a box. 

Figure 28 shows the amino acid sequence of the squid 
GTP-binding protein with the WD-40 repeats aligned and putative 
binding peptide regions delineated by a box. 

Figure 29 shows the amino acid sequence of the HSIEF 
20 930 protein with the WD-40 repeats aligned and putative binding 
peptide regions delineated by a box. 

Figure 30 shows the amino acid secjuence of the human 
12.3 protein with the WD-40 repeats aligned and putative binding 
peptide regions delineated by a box. 
25 Figure 31 shows the amino acid sequence of the human 

IEF-7442 protein with the WD-40 repeats aligned and putative 
binding peptide regions delineated by a box. 

Figure 32 shows the amino acid sequence of the 
insulin- like growth factor binding protein complex with the WD- 
30 40 repeats aligned and putative binding peptide regions 
dalineated by a box. 

7i'2'--2 33 sV.c*.*3 "he amino acid ^sq^ier.ce cf th3 rat 
insulin-like growth factor binding protein with the WD-40 
repeats aligned and putative binding peptide regions delineated 
35 by a box. 

Figure 34 shows the amino acid sequence of the human 
LISl protein with the WD-40 repeats aligned and putative binding 
peptide regions delineated by a box. 
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Figure 35 shows the amino acid sequence of the MD6 
protein with the WD- 40 repeats aligned and putative binding 
peptide regions delineated by a box. 

Figure 36 shows the amino acid sequence of the yeast 
5 MSIl protein with the WD-40 repeats aligned and putative binding 
peptide regions delineated by a box. 

Figure 37 shows the amino acid sequence of the mouse 
pc326 MUS protein with the WD-40 repeats aligned and putative 
binding peptide regions delineated by a box. 
0 Figure 38 shows the amino acid sequence of the ORD RBI 

protein with the WD-40 repeats aligned and putative binding 
peptide regions delineated by a box. 

Figure 39 shows the amino acid sequence of the 
periodic trp protein with the WfD-40 repeats aligned and putative 
15 binding peptide regions delineated by a box. 

Figure 40 shows the amino acid sequence of the FLAP 
protein with the WD-40 repeats aligned and putative binding 
peptide regions delineated by a box. 

Figure 41 shows the amino acid sequence of the 
10 retinoblastoma binding protein with the WD-40 repeats aligned 
and putative binding peptide regions delineated by a box. 

Figure 42 shows the amino acid sequence of the S253 
protein with the WD-40 repeats aligned and putative binding 
peptide regions delineated by a box. 
5 Figure 43 shows the amino acid sequence of the SOFl 

protein with the WD-40 repeats aligned and putative binding 
peptide regions delineated by a box. 

Figure 44 shows the amino acid sequence of the STE4 
yeast protein with the WD-40 repeats aligned and putative 
0 binding peptide regions delineated by a box. 

7igure 45 shews thrt anino acid sequence of the TFl 
tr:inEcript:iop. factor prot-in with tho WZ-40 repeats aliened and 
putative binding peptide regions delineated by a box. 

Figure 46 shows the amino acid sequence of the TUPl 
5 protein with the WD-40 repeats aligned and putative binding 
peptide regions delineated by a box. 



wo 95/21252 



FCTAJS9S/01210 



- 10 - 

Figure 47 shows the amino acid sequence of the TUPl 
homolog protein with the WD-40 repeats aligned and putative 
binding peptide regions delineated by a box. 

Figure 48 shows the amino acid sequence of the YCU? 
5 protein with the WD-40 repeats aligned and putative binding 
peptide regions delineated by a box. 

Figure 49 shows the amino acid sequence of the YCW2 
protein with the WD-40 repeats aligned and putative binding 
peptide regions delineated by a box. 
^0 Figure 50 shows the amino acid sequence of the YKL25 

protein with the WD-40 repeats aligned and putative binding 
peptide regions delineated by a box. 

Figure 51 shows the amino acid sequence of the YRB140 
protein with the WD-40 repeats aligned and putative binding 
15 peptide regions delineated by a box. 

Detailed D escription of the Invention 
I . Definitions 

Unless otherwise indicated, all terms used herein have 
the same meaning as they would to one skilled in the art of the 
20 present invention. Practitioners are particularly directed to 
Current Protocols in Molecular BiQlogy (Ausubel) for definitions 
and terms of the art. 

Abbreviations for amino acid residues are the standard 
3 -letter and/or 1-letter codes used in the art to refer to one 
25 of the 20 common L-amino acids. Likewise, abbreviations for 
nucleic acids are the standard codes used in the art. 

An "amino acid group" refers to a group of amino acids 
where the group is based on common properties, such as 
hydrcphobicity, charge, or size. 

A "ccnserv^d set" cf amine acids refers to a 
ccntigu.3U3 ^oT-nco of arino acids thac is cont^-r^/ed between 
members of a group of proteins. A conserved set may be anywhere 
from two to over 50 amino acid residues in length. Typically, a 
conserved set is between two and ten contiguous residues in 
35 length. The individual positions within a conserved set each 
typically comprise one of several amino acids, selected from an 
amino acid group (s). In cases where a residue is 100% conserved 
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IS 



20 



25 



30 



35 



at a particular position, the conserved set sequence will 
contain only that residue at that position. For example, for 
the two peptides WRTAA (SEQ ID NO: 263) and WRTAV (SEQ ID 
NO:264), there are 4 identical positions (MRTA; SEQ ID N0:265) 
and one position where the residue is an "A" or a "V. 

Proteins are typically long chains of amino acid based 
polyamides (polypeptides) capable of creating secondary and 
tertiary structure. Proteins may be composed of one, two or 
more polypeptide chains and may further contain some other type 
of substance in association with the polypeptide chain (s), such 
as metal ions or carbohydrates. The size of proteins covers a 
rather wide range from -5,000 to several hundred thousand 
g/mole. The 5,000 figure corresponds to the presence or roughly 
40-45 amino acids. 

Unless otherwise indicated, the sequence for proteins 
and peptides is given in the order from the amino terminus to 
the carboxyl terminus. Similarly, the sequence for nucleic 
acids is given in the order from the 5' end to the 3' end. 

The term "interacting proteins" refers to a pair of 
polypeptides that can form a stably- associated complex due to, 
for example, electrostatic, hydrophobic, ionic and/or hydrogen- 
bond interactions under physiological conditions. 

Proteins smaller than about 5,000 g/mole are typically 
referred to as polypeptides or simply peptides (Bohinski) . 

Two amino acid sequences or two nucleotide sequences 
are considered homologous (as this term is preferably used in 
this specification) if they have an alignment score of >5 (in 
standard deviation units) using the program ALIGN with the 
mutation gap matrix and a gap penalty of 6 or greater (Dayhoff) . 
The two sequences (or parts thereof) are more preferably 
hontolcgou3 if theii- ami.no acids are greater than or equal to 
30%, T.-r- preferably 70%, -till -ore oveferabiy 80%, identical 
when optimally aligned using the ALIGN program mentioned abcv* . 

A peptide or peptide fragment is "derived from" a 
parent peptide or polypeptide if it has an amino acid sequence 
that is identical or homologous to the amino acid sequence of 
the parent peptide or polypeptide. Homologous peptides are 
defined above. Exemplary derived peptides are peptide rlll (SEQ 



15 



20 



25 



PCT/US9»01210 



12 



ID N0:4) and peptide rVI (SEQ ID NO: 7), which are derived from 
the third and seventh WD-40 repeats of RACKl (SEQ ID NO: 2 7), 
respectively. 

The term "e^qjression vector" refers to vectors that 
5 have the ability to incorporate and express heterologous DNA 
fragments in a foreign cell. Many prokaryotic and eukaryotic 
expression vectors are commercially available. Selection of 
appropriate expression vectors is within the knowledge of those 
having skill in the art. 
10 The term "PKC" refers to protein kinase C, or C- 



kinase. 
kinase . 



The terra "RACK" refers to receptor for activated C- 



30 



35 



The term "PS" refers to phosphatidylserine . 

The term "DG" refers to diacylglycerol . 

The term "PL" refers to phospholipids. Phospholipids 
include both phosphatidylserine and diacylglycerol. 

The term "GVBD" refers to germinal vesicle breakdown, 
a measure of insulin- induced maturation in Xenopus oocytes. 

The term "PGR" refers to polymerase chain reaction. 

The term "NMR" refers to nuclear magnetic resonance. 

The term "^ARK" refers to i5-adrenergic receptor 

kinase . 

II • General Overvigjw of Invention. 

The invention relates to interacting proteins, at 
least one of which contains an amino acid sequence with one or 
more of the characteristic repeats termed WD-40 (Fong, et al.). 

According to one aspect of the invention, the function 
of a first protein of a pair of interacting proteins may be 
trodulated, altered or disrupted by the addition, to a solution 
or meciu:.. ccntainir.g the protein, of a p,.ptide h-zirj a seouence 
that IS Identical or homologous to a part of the sequence ci -. 
WD-40 motif-containing repeat present in a second protein of the 
pair of interacting proteins. 

The modulation or disruption of function of the first 
protein is due to the binding or association of the WD-40- 
derived peptide, termed "binding peptide", with the first 
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protein. The consequsnces of the binding or association of the 
binding peptide with the first protein depend on the sequence of 
the peptide. 

Typically, the presence of the binding peptide will 
5 inhibit the binding of the first protein to the second protein. 
This binding may be assayed in vitro by, for example, an overlay 
assay, whereby the degree of binding of one protein to another 
may be assessed. Several adaptations of overlay assays applied 
to embodiments of the present invention are described herein. 

Regardless of whether or not the WD-40-derived peptide 
affects the association of the first protein with the second 
protein, the peptide may alter or modulate defined activities of 
the first protein. These activities may be assayed by a variety 
of methods in vivo and/or in vitro. The method (s) employed 
15 depend on the protein whose activity is being measured. 

An exemplary first protein of a pair of interacting 
proteins is protein kinase C (PKC) . Upon activation, PKC 
interacts with receptors for activated C kinase (RACKs) , at 
least one of which (RACKl) contains WD-40 repeats. Several 
20 assays for determining the activity of PKC in the presence and 
in the absence of peptides derived from the WD-40 region of 
RACKl are detailed herein. 

Certain "interacting proteins" interact only after one 
or more of them has been stimulated by an exogenous or 
25 endogenous factor ( s) . For instance, PKC, as shown herein, does 
not bind to RACK proteins until it has been activated by, for 
example, phosphatydil serine (PS), diacylglycerol (DG) and 
calcium. However, peptides derived from WD-40 repeats of a 
second protein of such a pair may be able to associate with or 
bind to the first protein even in the absence of activators of 
tne first protein, ani in so d^ing, affect the function of the 
firs-, crctain [e.g. activate, ir.-ctiv- te .. potentiate, sensitize, 
desensitize, alter the specificity, ecc). 

Binding peptides derived from WD-40 repeats of a 
35 second protein of a pair of interacting proteins, may be useful 
as specific agonists, antagonists, potentiators of function, and 
the like, of the first protein of the pair. These properties 
may make the. peptides useful in a number of applications, for 
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example, direct use in therapeutic applications or as lead 
conipounds for the development of other therapeutic agents, e.g.,- 
small organic molecules. 

Ill- Advantages of the Invention for the Inhibition of Activated 
5 PKC Binding to RACKl- 

Protein kinase C (PKC) is a family of at least 10 
isozymes that share common structures and biochemical 
characteristics. It has been demonstrated that several isozymes 
are present within a single cell type, and it has been assumed 

10 that individual PKC isozymes are involved in different cellular 
functions. However, so far, the available activators and 
inhibitors of PKC do not appear to be isozyme-specif ic. 
Therefore, it is currently impossible to determine the role of 
individual PKC isozymes in normal cellular functions as well as 

15 in disease • 

PKC activation by, for example, diacylglycerol and 
calcium, induces the translocation of PKC from a soluble 
(cytosolic) to a cell particulate (membrane-associated) 
fraction, as shown in experiments herein (Example 8) . Activated 

20 PKC is stabilized in the cell particulate fraction by binding to 
membrane-associated receptors (receptors for activated C-Kinase, 
or racks) . 

In experiments done in support of the present 
invention and described herein, a clone (pRACKl) encoding a RACK 

25 has been isolated (Example 1) . RACKl belongs to a growing 
family of proteins that are homologous to the E-subunit of 
transducin and contain the WD-40 motif (Fong, et al.) . It was 
demonstrated that peptide I (SEQ ID N0:1) binds to purified PKC 
(see Example 6 and Fig. 4), inhibits the binding of PKC to 

30 nurific-d reccrr:;^inant: R.v:y.l protein (sea Example 4 and Fig. 3), 
and inhibits ?::c aczivit/ in several in vivo and in vitro assays 
(see Examples 7-11 and Figs. 5-9). 

Peptide I (SEQ ID N0:1) is homologous to a sequence 
identified in the sixth WD-40 repeats of RACKl (see Fig. IC) . A 

35 synthetic peptide was prepared based on this sequence (peptide 
rVI; SEQ ID NO: 7; underlined amino acids in repeat VI of Fig. 
IC) . Six more peptides were also prepared based on the 
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corresponding regions in repeats I-V and VII (peptides rl-rV, 
rVII; SEQ ID NO: 2 -6, 8; underlined regions in corresponding 
repeats. Fig. IC) . Some of the peptides were also found to 
inhibit the binding of PKC to RACKl (see Example 4 and Fig. 3) . 
5 In addition, some of the peptides were found to bind to purified 
PKC (see Example G, Fig.' 4), partially activate PKC in the 
absence of other activators (peptide rVI; see Examples 7, 10, 11 
and Figs. 5, 8 and 9), and potentiate the effects of known PKC 
activators on the enzyme (see Examples 7-9 and Figs. 5-7). 
10 In Xenopus oocyte maturation studies (see, for 

instance, Example 7), peptide rVI (SEQ ID N0:7) is an agonist of 
/SPKC. Peptide rlll, while less potent, is also an agonist of 
PKC; it enhances insulin- induced oocyte maturation at 50 and 
500^M. 

I^ cardiac myocytes, norepinephrine (NE, 2§m) causes 
translocation of 6 and ePKC isozymes from the cytosolic to the 
particulate fraction. Introduction into cardiac myocytes of 
peptide rlll, and to a lesser extent peptide rVI, caused an 
immediate translocation of 6 and ePKC isozymes in the absence of 
20 hormone stimulation. This peptide -induced translocation was 
followed by degradation of 6 and ePKC isozymes. Moreover, 
NE- induced translocation is further enhanced in cells containing 
peptide rlll. 

In contrast, introduction of peptide I to these cells does 
25 not affect PKC distribution in the absence of hormone 

stimulation, nor does it induce PKC degradation. Furthermore, 
NE-induced translocation is inhibited by peptide I. Similar 
concentrations of a number of control peptides did not affect 
PKC distribution or degradation in control or NE-treated cells. 

In studies on rat cardiac myocytes, peptide rlll 
inducsd 6T-KQ and ePKC activaticn thzt w^s followed by 
corr-jf aticn cf thesa activaJi-id ii^czv-TP.ec . 

Peptide rVI also augments hormone -induced 
translocation of PKC isozymes (see, for example. Example 8 and 
35 Fig. 6). In contrast, peptide I (SEQ ID N0:1) inhibited 

hormone -induced translocation of PKC isozymes (Example 8, Fig 6) 
and did not cause degradation. 
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The data summarized above demonstrate that peptides 
derived from WD-40 repeats of RACKl can serve as PKC agonists 
and antagonists in vivo, and suggest that peptides derived from 
WD-40 regions of RACKl contain at least part of the protein- 
5 protein interface between PKC and RACKl. 

Furthermore, the results suggest that (i) WD-40 
repeats present in other proteins, such as 0)3 subunit, may also 
be located at or near a surface involved in protein-protein 
interactions, (ii) peptides derived from these repeats may be 

10 effective in disrupting the interactions of the proteins with 
their partners (e.g. j3-adrenergic receptor kinase (/3ARK) , (iii) 
the peptides may modulate or alter the activity of the proteins 
with which the WD-40 repeat -containing proteins interact, and 
(iv) the peptides may therefore have specific biological effects 

15 when administered in vivo. 

IV. Identification of Pairs of Interacting Proteins. 
A. Biochemical Approaches. 

Novel interacting proteins may be identified and 
isolated by a number of methods known to those skilled in the 

20 art. For example, monoclonal antibodies raised to a mixture of 
antigens, such as a particular tissue homogenate, may be 
characterized and used to immunoprecipitate a single class of 
antigen molecules present in that tissue. The precipitated 
proteins may then be characterized further, and used to co- 

25 precipitate other proteins with which they normally interact 
(Hari, et al., Escobedo, at al.). 

An alternate method to identify unknown polypeptides 
that interact with a known, isolated protein is by the use of, 
for example, an overlay assay (V7olf, et al., Mochly-Rosen, et 

30 al., 19S1) , \ TT.ixture (such as a fraction of a tissue 

hcTT.cger.ace- for ^x™ple, a Triton- insoluble proL-sin fraction) 
potentially containing proteins that bind to a known, isolated 
protein can be resolved using PAGE, blotted onto a 
nitrocellulose or nylon membrane, and contacted with a solution 

35 containing the known protein and any necessary co- factors or 
small molecules. After washing, the membrane can be contacted 
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with a probe for the known protein, for example an antibody or a 
mixture of antibodies, and the signal visualized. 

B. Molecular Approaches. 

Putative binding proteins of a known protein may be 
5 isolated from tissue homogenates, as described above. 

Alternatively, DNA clones encoding putative binding proteins may 
be identified by screening, for example, an appropriate cDNA 
e:q>ression library. Expression libraries made from a wide 
variety of tissues are commercially available (for example, from 
10 Clonetech, Palo Alto, CA) . Expression libraries may also be 
made de novo from organisms and tissues of choice by 
practitioners skilled in the art. 

The screening of expression libraries for clones 
expressing a protein or protein fragment of interest may be 
readily accomplished using techniques known in the art, for 
example, an overlay assay. 

An overlay- assay screening method may be used to 
identify clones expressing a (known or unknown) protein or 
protein fragment that binds to a probe in hand. The probe may 
be a protein postulated to be involved in protein-protein 
interactions with a protein expected to be present in a cDNA 
library selected for screening (as was the case for the cloning 
of RACKl, detailed in Example 1) . 

Actual screening of a selected cDNA library may be 
25 accomplished by inducing plated clones to express cloned 
exogenous sequences, transferring replicas of the induced 
plaques or colonies to filter membranes, and screening the 
membranes with an appropriate probe. According to this m.ethod, 
lifts of filters (for example, nylon or nitrocellulose) from an 
apprcpristely- induced cD::a librar-/ plates (induced bv, for 
example, IPTG) are v/a£h5d, blcck^d, and incubatad with a 
selected probe for a period of time sufficient to allow the 
selected probe (s) to bind specifically to polypeptide fragmenu.- 
present on the filters. The filters may then be washed and 
reacted with a reagent (for example, antibodies such as alkaline 
phosphatase -conjugated goat anti-rabbit or anti-mouse 
antibodies, available from Boehringer Mannheim Biochemicals, 
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Indianapolis, IN) . Additional reactions may be carried out as 
required to detect the presence of bound probe. 

One such overlay assay, described in Example l, was 
used to screen a rat brain cDNA expression library for proteins 
5 that bind purified PKC in the presence of PKC activators 

(phosphatydilserine, diacylglycerol and calcium) . The filters 
were screened with a mixture of , rat brain PKC isozymes {a, p, 
6, c and f) . Following a series of washes, bound PKC isozymes 
were detected with a mixture of anti-a, 0, y PKC mouse 
10 monoclonal antibodies, and anti-6, e and f PKC rabbit polyclonal 
antibodies. Bound antibodies were detected using alkaline 
. phosphatase-conjugated goat anti-rabbit or anti-mouse antibodies 
and 5-bromo-4-chloro-3-indoyl phosphate p-toluidine salt as a 
substrate. 

Once a clone is identified in a screen such as the one 
described above, it can be isolated or plaque purified and 
sequenced. The insert may then be used in other cloning 
reactions, for example, cloning into an expression vector that 
enables efficient production of recombinant fusion protein. 
20 Examples of appropriate expression vectors are pGEX (Smith, et 
al., 1988) and pMAL-c2 (New England BioLabs, Beverly, MA). An 
expression vector containing an insert of interest may be used 
to transform appropriate host cells, such as E. coli, and the 
transformed host cells can be used to produce the recombinant 
25 protein in large amounts. 

Typically, a recombinant protein is expressed in 
tandem with a bacterial or viral gene product (endogenous 
polypeptide) as part of a fusion protein. The junction between 
the endogenous polypeptide and the recombinant protein typically 
includes a recognition site for a rare-cutting protease. The 
endogenous peptic- rr.ay d-signed to incorporate a uniq^ae 
affi::i»:y r-r: {a slicrt pa^-id- s-quenca) to facilitate th- 
purification of the fusion protein with an affinity reagent, 
such an antibody directed against the affinity tag. The 
recombinant protein may then be purified from the fusion protein 
using the appropriate protease. 

Purified recombinant protein may be used in a number 
of ways, including in an overlay binding assay to screen for 
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peptides or substances that inhibit binding between the 
reconibinant protein and an interacting protein. 

An example of the use of a cDNA clone to express 
protein is detailed in Example 2. RACKl cDNA, isolated as 
5 described above and in Example 1, was subcloned into an 

expression vector (pMAL-c2, New England BioLabs, Beverly, MA) 
capable of expressing a cloned insert in tandem with maltose- 
binding protein (MBP) . The vector containing the RACKl insert 
was used to transform TBI E. coli . which were then induced with 
10 IPTG. The cells produced a 78 kDa fusion protein comprised of 
RACKl fused to the MBP. The overexpressed fusion protein was 
purified on an amylose affinity column according to the 
manufacture's protocol (New England BioLabs, Beverly, MA) and 
incubated with protease Xa to separate the expressed insert from 
15 the MBP. Following the incubation, a 36 kDa RACKl protein was 
obtained. 

V. Identification of WD-40 Repeats. 

According to a method of the present invention, 
protein-protein interactions can be disrupted and/or the 
activity of an interacting protein can be altered, given at 
least one of the interacting proteins contains a WD-40 motif or 
region, with a peptide (s) derived from a WD-40 repeat (s) of one 
of the proteins. 

WD-40 repeats are typically found in a family of 
proteins having at least a limited homology with the S subunit 
of transducin. WD-40 repeats present in a selected member of 
this family can be identified by (A) performing a self -homology 
analysis on a selected protein using a homology matrix 
(performed by, for example, the computer program DNA Stridor 
1.2, available from Christian Marck, Service de Bicchemie et d- 
'^en^t-.rv.s Molsculaire, Eepart-e.-i- de Zi^iogie Cellulai^-e et 
Moleculaire, Direction des Sciences de la Vie - CEA - FRANCE). 
(B) aligning sequences comprising the repeating elements 
revealed by the homology matrix analysis, and (C) identifying 
conserved amino acid residues that typically serve to define a 
WD-40 repeat. The steps are discussed individually, below 
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A. HomolocTY matrix analysis. 

Determining whether a particular amino acid sequence 
contains repeated motifs may be accomplished by a number of 
methods known to those skilled in the art. They range from a 
5 single visual inspection of the sequence to the use of computer 
programs which can identify repeated motifs. One widely- 
in^lemented computer-assisted method is to generate a self- 
homology matrix. A self -homology matrix computes the homology 
of each amino acid residue in a particular sequence with every 
10 other residue in that sequence. The homology scores are stored 
in a 2 -dimensional matrix. 

Values higher than a selected criterion level are 
flagged and displayed as points on an x-y coordinate. The x- 
and y-axes correspond to consecutive amino acid positions in the 
15 sequence. 

An example of a self -homology matrix analysis is shown 
in Figure IB. The matrix was generated using the computer 
program DNA Strider 1.2 (Christian Marck, Service de Biochemie 
et de Genet ique Molecuiaire, Department de Biologie Cellulaire 

20 et Molecuiaire, Direction des Sciences de la Vie - CEA ~ FRANCE) 
with the amino acid sequence of RACKl (SEQ ID NO: 27) with a 
window setting of 21 and a stringency of 6. Some typical 
features of a self -homology matrix are evident in the figure. 
The graph shows a "primal^" diagonal line extending from the 

25 origin with a slope of unity, corresponding to the fact that the 
sequence is identical to itself. If the sequence contains 
repeating elements, as RACKl does, there will be other, shorter 
sets of contiguous points arranged in diagonal lines 
substantially parallel to the primary diagonal and offset from 

30 the primary diagonal in the x- or y-directions . These shorter 
lines identify th3 locations of repeating elements with the 
S;;q'acnc!J. Zach r-ry^.itir.g element v/i?.l result in two s^ts of 
displayed points, symmetrically distributed about the primary 
diagonal . 

35 The data displayed in a homology matrix analysis can 

be used to locate and roughly align the sequences of repeating 
elements for a more detailed analysis. The horizontal band 
delineating the region between -100 and -130 on the y-axis in 
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Fig. IB highlights the fact that portions of that region of 
RACKl, that is, the amino acids between about amino acid 100 and 
amino acid 130, are repeated a total of seven times in the 
sequence of RACKl. Arrows point to the repeats in the homology 
5 matrix. For purposes of rough alignment, the short diagonal 
lines pointed out by the arrows can be extended to the 
horizontal line at amino acid -100 on the y-axis, and the x-axis 
location corresponding to the intersection be noted. For 
example, the intersection corresponding to the second repeat 
10 (second arrow from the left) is at x=-50) . 

Values determined in this manner may then be used to 
align the amino acid sequence of the repeats with each 
consecutive repeat beneath the preceding one, the start of each 
repeat corresponding approximately to the amino acid position 
15 determined by the analysis in the preceding paragraph. The 

amino acid sequence of RACKl, aligned in this manner, is shown 
in Fig. IC. 

Most commercially-available DNA and protein sequence 
analysis programs have the capability to perform a self -homology 
20 matrix analysis. One example is the program DNA Strider 1.2 
(Christian Marck, Service de Biochemie et de Genetique 
Moleculaire, Department de Biologie Cellulaire et Moleculaire, 
Direction des Sciences de la Vie • CEA - FRANCE) . 

Once the repeating elements are identified and the 
sequences corresponding to repeating elements are roughly 
aligned, one may proceed to define the degree of homology among 
the individual repeats at the specific positions within the 
repeats, as is described below. 
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B. Aliening amino acid secruences. 

If a self-horiolcgy matrix was used to obtain a crude 
'i::.gr.r-nt, the seq^jenc-3 tthv aliened hy eye on a rerscnal 
computer or the like using, for example, a text editor, a 
drawing program or a sequence-analysis program. Examples of 
programs effective to accomplish an alignment include "MACDRAW 
35 PRO" (Claris Corp., Santa Clara, CA) and "WORD" (Microsoft 

Corp., Redmond, WA) , both of which are available for "MACINTOSH" 
series computers (Apple Computer Corporation, Cupertino, CA) , as 
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well as IBM- compatible computers running "WINDOWS" (Microsoft 
Corp . ) . 

Amino acid sequences corresponding to internal repeats 
can also be aligned automatically using a protein sequence 
5 analysis program, such as "MACVECTOR" (Eastman Kodak Co., New 
Haven, CT) . 

According to a method of the invention, aligned 
sequences are examined further to determine if they fulfil 
criteria to be defined as WD-40 repeats. These criteria are 
10 detailed in part C, below. 

C. Amino acid residues that define a WD-40 repeat. 

Upon completion of steps outlined in parts A and B 
above, that is, determining whether a particular protein 
contains internal repeats, and if so, aligning those repeats, it 

15 is necessary to determine whether the aligned repeats contain 
WD-40 regions. 

A WD-40 motif is roughly defined as a contiguous 
sequence of about 25 to 50 amino acids with relatively-well 
conserved sets of amino acids at the two ends (amino- and carboxyl- 

20 terminal) of the sequence. Conserved sets of at least one WD- 
40 repeat of a WD-40 repeat -containing protein typically contain 
conserved amino acids at certain positions. The amino-terminal 
set, comprised of two contiguous amino acids, often contains a 
Gly followed by a His. The carboxyl- terminal set, comprised of 

25 six to eight contiguous amino acids, typically contains an Asp 
at its first position, and a Trp followed by an Asp at its last 
two positions. 

A more accurate definition of a WD-40 motif 
incorporates the observation that while specific residues, such 

30 as those identified above, ara not always conserved within a WD- 
40 rrctif, conserved positions within th^ r.ctif are typically 
occupied by residues selected from a restricted class of amino 
acids • 

In order to better define the class of conserved 
35 residues at selected positions, it is necessary to group amino 
acids on the basis of certain common properties. A functional 
way to define common properties between individual amino acids 
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is to analyze the normalized frequencies of amino acid changes 
between corresponding proteins of homologous organisms (Schulz) . 
According to such analyses, groups of amino acids may be defined 
where amino acids within a group exchange preferentially with 
5 each other, and therefore resemble each other most in their 
impact on the overall protein structure (Schulz) . Examples of 
amino acid groups defined in this manner, some of which are used 
in the definition of a WD-40 motif herein, include: 

(i) a charged group, consisting of Glu and Asp, Lys, Arg 
10 and His, 

(ii) a positively-charged group, consisting of Lys, Arg 
and His, 

(iii) a negatively-charged group, consisting of Glu and 
Asp, 

15 (iv) an aromatic group, consisting of Phe, Tyr and Trp, 

(v) a nitrogen ring group, consisting of His and Trp, 

(vi) a large aliphatic nonpolar group, consisting of Val, 
Leu and lie, 

(vii) a slightly-polar group, consisting of Met and Cys, 

(viii) a small-residue group, consisting of Ser, Thr, Asp, 
Asn, Gly, Ala, Glu, Gin and Pro, 

(ix) an aliphatic group consisting of Val, Leu, lie. Met 
and Cys, and 

(x) a small hydroxyl group consisting of Ser and Thr. 

addition to the groups presented above, each amino 
acid residue may form its own group, and the group formed by an 
individual amino acid may be referred to simply by the one 
and/or three letter abbreviation for that amino acid commonly 
used in the art. 

-"^ "WD-40" motif is defined herein as a contiguous set 
of anir.o acids between (inclusive) tv/o sets of relatively v/ell 
conserved resicuss, tarr^-d h-r-in as --n "amino-terminal set" and 
a "carboxyl- terminal set". 

The amino- terminal set contains two adjacent amino 
35 acids. The residue at the first position is typically selected 
from groups ii, vi or viii, while the residue at the second 
position is typically selected from groups i, x or He. The 
first and second positions will often consist of Gly and His, 
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respectively. The Gly and His residues are typically present in 
at least one of the aligned repeats of a WD- 40 -containing 
protein. 

The carboxyl- terminal conserved set typically includes 
5 eight residues, but may contain as few as six residues. The 
most well-conserved residue in WD-40 motifs identified thus far 
is an Asp residue, comprising the first amino acid of the 
carboxyl -terminal conserved set. It is present in virtually all 
WD-40 repeats illustrated herein. In those repeats where it is 
10 not present, the position is occupied by a residue from groups 
iii or Gly. 

The last two amino acids in the carboxyl -terminal 
conserved set are typically selected from groups iv or lie, and 
groups i or viii, respectively. The most commonly used residue 

15 at the first of these positions is Trp, It is typically present 
in at least one of the WD-40 repeats of any given protein. The 
second position is occupied less consistently . by a single 
residue, but is often occupied by Asp. The Trp -Asp (WD) 
combination is part of the namesake of WD-40 repeats. 

20 The amino acids present in the internal portion of the 

carboxyl -terminal conserved set are less well -conserved than the 
terminal residues, and their total number may differ by up to 
two residues in different WD-40 repeats. The third position in 
from the carboxyl -terminal end of the carboxyl -terminal 

25 conserved set is typically selected from groups viii or ix, more 
typically ix. The fifth position in from the carboxyl -terminal 
end of the carboxyl -terminal conserved set is also typically 
selected from groups viii or ix, more typically ix. 

The length of a WD-40 repeat, including the amino- 

30 terminal and carboxyl -terminal conserved sets is typically 

becveen about 25 and about 50 residues, mere typically between 
^.bcu:: 2? and 34 residu'^s. Thcs distribution aris-es primarily 
from differences in the number of residues present between tr.3 
amino -terminal and carboxyl -terminal conserved sets. 

35 The number of WD-40 repeats in a particular protein 

can range from two to more than eight. The average number is 
about 5. 
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A determination of whether or not a set of aligned 
internal repeats are WD-40 repeats can be facilitated by an 
examination of all of the repeats as a whole, rather than an 
examination of each repeat individually. This is in part 
5 because not all of the aligned repeats will necessarily contain 
all of the conserved sequences that serve to identify WD-40 
repeats, although the conserved residues will typically appear 
in at least one of the repeats. 

For example, Fig. IC shows the RACKl amino acid 
0 sequence aligned to illustrate the internal repeats present in 
the sequence. All of the repeats are WD-4 0 repeats, even though 
the amino -terminal conserved set of repeat VI, for instance, 
contains an "LD" as opposed to the more usual "GH", and the 
carboxyl -terminal conserved set contains a "G" at its first 
position, as opposed to the highly- conserved "D" . Similarly, 
the carboxyl -conserved set of, for example, repeat I, contains a 
"WK" at the last to positions, as opposed to the more usual 
"WD" . 

It will be appreciated that certain residues or sets 
of residues will be well -conserved in the WD-4 0 repeats of a 
selected protein, even though they may not be conserved in WD-40 
repeats in general. Such residues or sets of residues may be 
useful in several ways. For example, they may be used in 
performing an alignment of internal repeats in a selected 
protein, as described in part B, above. The residues may also 
be useful for identifying regions based on which effective 
binding peptides may be designed (see section VI., below). 

Identif ication of WD-40 repeats in RACKl, 
In experiments done in support of the present 
invention, a protein that binds to activated FKC was clonad and 
se.rp:Bnc^± (see Example 1) . .^oqu.-r.c- -.nalysis of che deduced 
amino acid sequence revealed the presence ci re-^-cs, which \:^r- 
aligned and are shown in Figure IC. 

The aligned repeats were identified as WD-40 repeats 
by application of the criteria identified in parts A, B and C 
above. For example, the conserved amino- terminal set in repeats 
I, II, III and V consists of the typical "GH", whereas in 
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repeats IV, VI and VII ^ the set consists of other residues. 
These other residues, however, are contained in at least one of 
the' amino acid groups identified above as conserved at the 
appropriate position. The conserved carboxyl- terminal set 
5 contains the highly- conserved "D" at its first position in all 
repeats except repeat VI. The second- to -last position of this 
set contains the relatively-well conserved "W" in each repeat, 
while the last position contains the typical "D" in repeats II, 
V and VI, and other residues in the other repeats. 
10 Taken together, these data indicate that the repeats 

contained in RACKl are V7D-40 repeats. The data also illustrate 
that not all repeats contain all of the elements typical of a 
WD-40 motif, but that when the repeats are aligned and viewed 
together as a whole, a WD-40 motif is apparent in all repeats. 



15 E. Identification of WD-40 repeats in sequenced proteins. 

Data were compiled in support of the present invention 
to illustrate how WD-40 repeats in various proteins may be 
identified, and to illustrate the diversity of amino acid 
sequences that may be properly identified as WD-40 repeats by 

20 those skilled in the art following the guidance set forth 

herein. Two methods that were used to identify WD-40 -containing 
protein sequences are detailed in Example 7. 

In the first method, proteins identified in their 
description as having a homology to jS-transducin were examined 

25 as detailed in parts B-D, above, for WD-40 repeats, 30 proteins 
were identified in this manner. The amino acid sequences of 
these proteins, with the WD-40 regions aligned and delineated, 
are shown in Figs. 12-18, 20-27, 29-30, 34-35, 37-38, 40 and 42- 
50. The sequences are represented in the Sequence Listing as 

30 SEQ ID 2:0:::3-35, 37-44, 46-47, 51-52, 54-35, 57 r.r.d 53-67. 

In t'r.'Ci j:-:cr;r.d -r.ethod, prorsj.ns vhc32 ^eri'^nces v/ere 
homologous to a consensus WD-40 motif (SEQ ID NO: 262) , were 
identified and examined for WD-40 repeats. Ten additional 
proteins containing WD-40 repeats were identified with this 

35 strategy. The amino acid sequences of those proteins, with the 
WD-40 repeats aligned and delineated, are shown in Figs. 11, 19, 
28, 31-33, 36, 39, 41 and 51. The sequences are represented in 
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the Sequence Listing as SEQ ID NO:28, 36, 45, 48-50, 53, 56, 58. 
and 66. 

Other types of searches may be equally effective at 
identifying proteins which may contain WD-40 repeats. For 
example, on-line databases such as GenBank or SwissProt can be 
searched, either with an entire sequence of a WD-40-containing 
protein, or with a consensus WD-40 repeat sequence. Various 
search algorithms and/or programs may be used, including FASTA, 
BLAST or ENTREZ. FASTA and BLAST are available as a part of the 
GCG sequence analysis package (University of Wisconsin, Madison, 
Wisconsin) . ENTREZ is available through the National Center for 
Biotechnology Information, National Library of Medicine, 
National Institutes of Health, Bethesda, MD. 

Sequences identified with a protein homology search 
are then analyzed as described in parts A, B and C, above, to 
identify potential WD-40 motifs. Once located, the motifs can 
be aligned, and effective binding peptides may be designed. 

^- Identification of WD-40 reainn^ in novf>1 polvnenHrfPQ 
WD-40 repeats may be identified in a novel polypeptide 

20 by, for example, the methods described in parts A-D above. It 
will be appreciated, however, that step A above (homology 
matrix) is not required in the identification of WD-40 repeats. 
Following the guidance of the present invention, one skilled in 
the art may, for instance, identify a WD-40 motif while scanning 

25 the sequence of some, perhaps novel, polypeptide merely through 
a recognition of one or more of the features characteristic of 
WD-40 repeats. 

The precise methods by which one skilled in the art 
arrives at the conclusion that a particular motif is a WD-40 
30 repeat is less relevant to the pres-nt invention than is the use 
of -5,.-nr:es derived .fron. wa-io motifs, regardless of hew they 
are identified, to design peptides efic-cive to alter or 
modulate the activity of one member of a pair of interacting 
proteins and/or to disrupt protein-protein interactions. 

^5 . I^epti^icfafion of Activitv-alh«.T>Hn,T PeptM«.« 



W05JV21252 



- 23 - 



PCT/DS9S/01210 



10 



15 



Upon the alignment and recognition of WD-40 repeats in 
a particular protein, one may proceed to design a peptide or a 
set of peptides that may be effective to associate with or bind 
to the protein with which the WD-40 -containing protein normally 
associates. Such a binding or association may be expected to 
alter or modulate the activity of the protein and/or disrupt the 
association of the pair of interacting proteins. 

The sequence of such a peptide will typically be 
homologous, if not identical to, a contiguous amino acid 
sequence contained within at least one of the WD-40 repeats. 
Examples of the selection of WD-40-derived peptides effective to 
disrupt protein-protein interactions are detailed in parts C and 
D below, for RACK-PKC and GP/y-PARK interactions, respectively. 

^- Choosing an appropriate rficri^n within a WD-40 rt-p^^f 
Putative binding peptides may be selected from any 
portion of a WD-40 repeat, if it is desired to obtain a degree 
of discrimination between the various WD-40-containing proteins, 
peptides should be chosen from the region between, and not 
including, the amino -terminal and carboxyl- terminal conserved 
20 sets. This "central region" typically shows greater sequence 
diversity between different WD-40-containing proteins than the 
terminal regions, and is roughly outlined by boxes in Figures 
11-51, which show the amino acid sequences and aligned WD-40 
repeats of various WD-40 repeat -containing proteins, within the 
central region, peptides should be selected from sequences that 
have little or no homology to any other known sequences, save 
the sequence (s) of the protein(s) targeted for disruption. 

For example, peptides rlll (SEQ id N0:4, seven amino 
acids) and rVI (SSQ ID N0:7, eight amino acids) , are identical 
to s=gn--nts ct PJVCKl WD-40 repeats (III r,r.d VI, respectively) 
-egir.nir.g t.-.-.-j :=-ino acids in frcn th,j asino tor:::ini of the WD- 
40 repeats from which they are derived (see Fig ic, underlin- ' 
segments) . The WD-40 repeat segments corresponding to the 
binding peptides comprise the left portion of the central region 
35 of the respective WD-40 repeats, and are not well -conserved in 
RACKl. 
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If it is desired to inhibit the interactions of, for 
example, all of the isoforms of a particular WD-40 -containing 
protein family, a sequences is selected that includes a 
significant number of residues that are shared or highly 
5 homologous among at least one WD-40 repeat of each of the 
targeted isoforms. 

If, on the other hand, an isoform- specific reagent is 
desired, a sequence is selected from a WD-40 repeat (s) of a 
specific isoform, where that sequence does not include a 
10 significant number of residues that are identical or highly 

homologous to residues in WD-40 sequences from related isoforms. 

°- Choosing an aDDrnr.^^«^^ length f^^ a DPn^^r^o 

,-ni- .V, ^^"'^^"Sr peptides may be designed that range 

in length from as few as about four residues to 40 or more 
residues. Preferably, binding peptides will have a length of at 

lenln f " ^° residues. The 

length wxll be determined in part by the degree of desired 
homology to other WD.4.0 repeats, as described in part A above, 
and by the level of discrimination between proteins that is 
20 required. 

For example, binding peptides selected from RACKl 
sequences to inhibit RACKl/PKC interactions were seven and eight 
am.no acids in length. The peptides are long .nough to binl 
specfxcally to the targeted sequences, but short Lugh to not 
5 cross-react with other WD-40 repeat binding proteins. Thele 
properties enable the peptides to have very selective and 
specific effects, as is shown below in Examples 6-11 



C. 



d esign of PAPKI v^d n..^,^H.. .^ y.^^. 



(S"o r. . '° and rVI 

S.Q ID ^0:7. eight amino acids) were designed in part follov. - 

iden!Icart" '^^'^ " ' ^^^^^^^ 

Identical to segments of RACKl WD-40 repeat sequences beginni-^ 

five ammo acids in from the amino termini of the WD-40 repeats 

from Which they are derived. The WD-40 repeat segments 

corresponding to the binding peptides comprise the left portion 
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of the central region of the WD-40 repeats. The peptides v;ere 
tested for their ability to disrupt protein-protein interactions 
in vitro and in vivo, as described in section VII and Examples 
6-11 below, 

5 D. Peptides derived from WD- 40 repeats of Human G-Beta 

inhibit interactions of G-Beta subunits with flARK. 
Methods described in section V part E were used to 
identify WD-40 repeats (SEQ ID NO: 128-134) in Human G-Beta (SEQ 
ID NO: 41) . Segments from the first six WD-40 repeats were 
10 selected for the design of G-beta binding peptides (SEQ ID 
NO: 13-18) . The segments were selected based on criteria 
detailed in parts A and B, above. 

The G-beta binding peptides are used to disrupt the 
interactions of G-beta subunits with jSARK. The disruption is 
15 assayed using a modification of the overlay assay described in 
Example 4. 

VII . Testing of Putative Binding Peptides, 

Detailed below are several assays by which the 
efficacy of WD-40 -derived peptides at binding to a target 
20 protein, inhibiting protein-protein interactions, and altering 
or modulating the activity of a target protein may be 
determined. 

One class of assays, widely-used to assess the binding 
of two proteins to each other, are overlay assays. Overlay 

25 assays are generally applicable to most proteins. They can be 
used to, for example, assess the binding of WD-40-derived 
peptides to their targets, as shown in Example 6 and described 
in part B belov/. Overlay assays can also be used to assess the 
ability of V7D-40-dsrived peptides to inhibit the binding of two 

20 intera'-.ning proteins, one of v;hich contains a WD-40 motif from 
which the peptides were derived (see, for instance, Example 4 
and part C below) . 

Other assays may be used to assess effects of WD-40- 
derived peptides on the activity of the target protein. These 

35 assays may be in vivo assays, in vitro assays, or a combination 
of in vivo and in vitro assays. The assay used will depend on 
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the proteins involved and on th-a system (s) and/or process (es) 
that involve the interacting procains against which the peptide 
was targeted. For instance, the assays described in parts D-I 
e below are appropriate for characterizing PKC activity in vivo 

5 and in vitro. 

While many of the assays below are particularly useful 
for characterizing the activity of PKC, they also illustrate a 
general framework of experiments by which the effects of WD-'40 
derived peptides on other proteins may be assessed. 

10 A. Overlav assays to evaluate efficacy of putative 

binding peptides derived from WD-4Q regions. 
An overlay assay can be used to assess the disruption 
of the ability of a pair of proteins to associate. Methods for 
conducting overlay assays are well-known in the art (see, for 
15 example, Mochly-Rosen, etal., 1991). 

Applications of overlay assays to evaluate putative 
binding peptides for PKC/RACKl interactions are presented in 
Examples 4 and 5 herein. The assays can be generally described 
as follows. 

20 One protein of a pair of interacting proteins 

("immobilized" protein) can be resolved on an SDS/PAGE gel and 
blotted onto an appropriate membrane (for example, 
nitrocellulose or nylon) by methods known to those skilled in 
the art. The blots may then be contacted with a solution 

25 containing the other protein of the pair of interacting proteins 
("overlay" protein) in the presence, and in the absence of 
putative binding peptides. Following appropriate wash steps, 
bound overlay protein can be detected by the use of an 
appropriate probe, such as an antibody directed against the 

30 ovarlay protein. 

A variation on thc^ above pro^.ccol may be pericrmsd to 
minimize a possible interference between unbound binding peptif.o 
and antibodies used to detect the presence of bound overlay 
protein. The modification consists of performing another 

35 SDS/PAGE electrophoresis between the steps of binding the 

overlay protein, and detecting the overlay protein with antibody 
or other probe. It is accomplished by cutting the blot into 
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pieces sized to just encompass the area occupied by the blotted 
immobilized protein, after the overlay protein had been 
contacted (in the presence or in the absence of binding 
peptides) and allowed to bind to the blot. The pieces of 
5 membrane are then incubated in a sample buffer, placed in the 
wells of a second SDS polyacryl amide gel and subjected to 
electrophoresis . 

Following electrophoresis, the gel is blotted as 
above, and contacted with a probe, for example antibodies, to 
10 detect bound overlay protein. 

B . Binding of 0PKC to peptides homologous to a WD-'40 
region of RACKl . 

The binding of jSPKC to peptide I (SEQ ID N0:1) , 
peptide rVI (SEQ ID N0:7) and control peptide (SEQ ID NO: 9) was 

15 assessed in Example 6 using a PKC overlay assay similar to that 
described in Example 3 . Increasing amounts of peptides were 
applied onto nitrocellulose using a slot -blot apparatus. The 
membranes were incubated with PKC in the presence and absence of 
PS, DG, and calcium. 

20 The data are shown in Figure 4, and show that 

activated PKC bound to both peptides I and rVI at peptide 
amounts as low as 5 /xmoles, but not to the control peptide. 
Unactivated PKC did not bind to peptide I, but did bind to 
peptide rVI at similar concentrations. 

25 The results indicate that while the peptides were 

homologous to one another and were capable of binding to the 
same protein, they behaved differently. Peptide rVI (SEQ ID 
NO: 7; 8 residues) was able to bind to both activated as well as 
unactivated forms of PKC, v/hereas peptide I (SEQ ID N0:1; 15 

30 residues) could bind only to activated PKC. The differences 
betv;e-3n tha bir.iing prcc-irties m.ay be du3, for example, to 
charge differences and/or length differences between the two 
peptides. 
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C. Effects of peptides hor^olcqous to WD-40 region of 
RACKl on PKC binding to RACKl 

Two peptides (peptide rlll; SEQ ID NO: 4 and peptide 
rVI; SEQ ID NO: 7). identical to regions of RACKl WD-40 repeats 
5 (underlined, Figure IC) were, tested for their ability to inhibit 
PKC binding to recombinant RACKl using a modification of the 
overlay procedure referred to above. The experiment is detailed 
in Example 4 and the results are shown in Figure 3. 

Peptide I caused an 81±6% inhibition of - PKC binding to 
recombinant RACKl as compared with binding in the absence of 
added peptide. Both peptides rlll and rVI inhibited the binding 
of PKC to RACKl. In addition, peptides rl and rll were also 
effective inhibitors of the interaction of PKC to RACKl. A 
lesser inhibitory effect was obtained with peptides rIV and rV 
and no inhibition was obtained with peptide rVII. 

The difference in the peptide's ability to inhibit 
binding may reflect differences in the roles played by the 
corresponding WD-40 repeats in the protein-protein interactions 
between PKC and RACKl. The peptide's ability or inability to 
inhibit protein-protein interactions as assayed by an overlay 
assay, however, is not necessarily correlated with the effects 
those peptides may have on the activity. of the targeted 
proteins, as measured by both in vivo and in vitro assays and 
described in parts D-I below. 

0. Effects of peptides homologous to WD-40 regions of 
RACKl on PKC-mediated oocyte maturation. 
Peptides I (SEQ ID N0:1), rlll (SEQ ID N0:4) and rVI 
(SEQ ID NO: 7) were also tested for their ability to affect 
insulin-induced, PKC-mediated maturation in Xenopus oocytes, as 
detailed in Example 7 and shewn in Figures 5A and 5C. 

:?KC in involved in tho iT.aturaticn of Xenopus oocytes. 
Phorbol escers, which activate PKC, or microinjection of a 
constitutively active mutant of PKC induce the first stage of 
oocyte maturation in the absence of hormones. Exposure to 
insulin causes an increase in diacylglycerol levels and 
microinjection of activated PKC enhances insulin- induced 
maturation (Stith, et al.) . Microinjection of purified RACK 
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proteins causes a significant '3?.crease in the rate of oocyte 
maturation (Smith, et al., 1992). The insulin- induced oocyte 
maturation assay therefore provides an effective in vivo assay 
for compounds that interfere with the function of PKC. 
5 The maturation response was quantified by monitoring 

the appearance of a white spot in the animal hemisphere of the 
oocyte, indicating germinal vesicle breakdown (GVBD) and 
maturation. The indicated peptides were microinjected into 
Xenopus oocytes and the percent of oocytes with GVBD following 
insulin exposure was plotted as a function of time in Figures 5A 
and C. 

Approximately 80-85% of sham- injected (control) 
oocytes exposed to insulin reach maturation, as compared with 
45-50% of oocytes injected with peptide I. The rate of 
maturation of those oocytes that did mature was similar in the 
two cases. In contrast the effects of peptide I, both peptides 
rlll and rVI potentiated the effects of insulin on oocyte 
maturation, both in terms of the rate of maturation, and in the 
total fraction of oocytes that mature during the experiment. 
Injection of peptides rlll or rVI increases the fraction of 
maturing oocytes to essentially 100%, Furthermore, peptide rVI 
induced oocyte maturation in the absence of insulin stimulation 
(Fig. 5B) . 

Together, the data above indicate that peptides 
homologous to the WD-40 region of RACKl can modulate the 
function of a protein with which RACKl interacts (e.g. PKC), 
that the modulation can occur in vivo, and that it can have 
clear and profound physiological consequences. Furthermore, the 
results with peptide rVI suggest that under appropriate 
circumstances, the peptide alone may act to activate PKC, in the 
abcance of ether accivating substances. 

E. Effects of pep tides hcmoloaous to WD-40 regions of 
RACKl on PKC translocation in Xenovus oocvteg. 
Insulin causes the redistribution of /3PKC, but not 
other PKC isozymes, from a cytosolic form to a membrane- 
associated form, as evidenced by the relative levels of PKC in 
the soluble vs. the particulate fraction of oocyte homogenate. 
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To assess the effects of RACKl WD- 40 -derived peptides on 
insulin- induced PKC translocation, 50 nl of a 20 mM NaCl 
solution containing the indicated peptides were microinjected 
into Xenopua oocytes. The oocytes were then homogenized, and 
5 the relative amount of PKC in the soluble and particulate 

fractions was assayed. The protocol followed was a modification 
of a method described by Smith, et al (1992) . The results are 
shown in Figure 6. 

Peptide I (50 ^M) did not affect /3PKC distribution in 
10 untreated oocytes, but inhibited insulin -induced i3PKC 

translocation (Fig, 3, lanes 7,8). In contrast, peptide rVI (50 
fxM) induced /3PKC translocation in the absence of insulin 
treatment (Fig. 3, lanes 3,4). These results suggest that 
peptide I is an antagonist of hormone -induced PKC translocation, 
15 whereas peptide rVI is an agonist and an activator of PKC 

translocation. In light of the results presented in Example 7, 
the data also suggest that the inhibition of insulin- induced 
GVBD following microinjection of peptide I was due to an 
inhibition of /JPKC translocation. 

2° F- Effects of peptides homolo gous to WD-4Q reainrtfl r^f 

RACKl on sensitivity of flPKC to Ara-C endooeptidas^ . 
Upon activation of PKC, a pseudosubstrate 
autoinhibitory sequence at the N-terminus of PKC dissociates 
from the catalytic site and renders the molecule sensitive to 

!5 endopeptidase Arg-C (Orr, et al.). Exposure of activated /3PKC 
to Arg-C results in a limited proteolysis, or "nicking" of the 
enzyme. The nicking typically generates a 78 kDa fragment and 
several small fragments. Continued exposure to Arg-C typically 
results in the disappearance of /3PKC (Orr, et al.) . 

'0 Since peptides rlll (SEQ ID N0:4) and rVI (SEQ ID 

2TO:7) ervhibitsd FKC agoniGt activities in cth-r assays (see, for 
instance Examples 7 and 8), experiments were performed to 
determine whether the peptides were capable of activating PKC in 
a manner to make it susceptible to endopeptidase Arg-C. The 

5 experiments are detailed in Example 9 and the results are shown 
in Figure 7. 
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In the presence of effective concentrations of PKC 
activators (0.8 ;,g/n,l DG, 50 ^g/ml PS and l CaCl,) , exposure 
of ^PKC to Arg-c resulted in nicking, generating the 78 kDa 
fragment (Pig. 7, lane 2) . in the absence of PKC activators, 
exposure of ^pkc (80 kDa) to endppeptidase Arg-c had no effect 
on the enzyme (Fig 7, lane 1) . 

Incubation of ^PKC with Arg-C at low concentrations of 
activators (2.5 ^g/ml PS and 50 /zM CaCl,) in the absence of added 
peptide, m the presence of control peptide (SEQ ID N0:9) and in 
the presence of peptide I (SEQ ID N0:1) did not result in 
appreciable nicking activity (Fig. 7, lanes 4, 8 and 9 
respectively) . However, incubation of /3pkc with the same low 
concentration of activators in the presence of peptides rlll or 
rVI resulted in the appearance of the 78 kDa nicked PKC fragment 
(effects of peptide rVI in Fig. 4, lanes 5-7). Concentrations 
as low as 10 nM of peptide rVI were sufficient to result in 
nicking activity, indicative of |8PKC activation. 

The results indicate that peptides rlll and rVI, but 
not peptide I, are effective to stabilize PKC in an activated 
conformation that renders it susceptible to Arg-c under 
conditions of low PKC activators that would otherwise not render 
tne enzyme susceptible to Arg-C. 

°' Effects of peptides hommoaou^ ho wn-. p 

.RACKl on <?PKC P.llhnpV,o gphQ^1:,^,-^„ 

can ba . ""''f ''''' " °' autophosphorylation, which 

can be assayed by incubation with tr--P]ATP and visualized on an 
autoradiograph of a gel. Anti-pseudosubstrate antibodies were 
shown previously to induce autophosphorylation in the absence of 
P<C activators (Makowske, et al.,. since peptide rVl (SEQ ID 
NO: 7, w,3 effective to induce PKC translocation and G^/BD in the 
r.o=enc3 of PKC activator3. expsrizents ...^re perforr.ed 
determine if the peptide was- also capable of inducing ^KC 
autophosphorylation. The experiments are detailed in Exan^le 10 
and the data are shown in Figure 8. 

CaCl, (1 „W) shows normal levels of autophosphorylation (lane 1) 
NO autophosphorylation was seen in the absence of PKC activators 



10 



15 



20 



^^°'^»252 PCr/OS9*01210 

- 37 - 

(lane 2) , or in the absence of PKC activators with peptide I 
(SEQ ID N0:1; lane 5) or control peptide (SEQ ID N0:9; lane 6). 
In contrast, peptide rVI in the absence of PKC activators 
induced PKC autophosphorylation to over 80% of the levels 
obtained for PKC alone in the presence of optimal concentration 
of PS, DG, and calcium (compare Pig. 8 lane 1 (control) with 
lane 4 (peptide rVl) ) . 

^- Effects of pepti des homologoug to WD-40 T-eaions nf 
RACKl on h istone phosphorvlation bv BPKC. 
Another measure of PKC activity is the ability of 
activated PKC enzyme to phosphorylate histones. PKC 
phosphorylation of histone was carried out using a modification 
of the protocol described by Mochly-Rosen, et al., (1987). 
Phosphorylation was carried out in the presence or absence of 
PKC activators (PS, DG and calcium) and RACKl-derived peptides. 
Phosphorylated histone was detected by autoradiography, 
following SDS-PAGE on a 10% gel. 

Since peptide rVI (SEQ ID NO: 7) was effective to 
induce the autophosphorylation of PKC in the absence of PKC 
activators, and both peptides rlll (SEQ ID N0:4) and rVI 
rendered PKC susceptible to proteolysis by Arg-C, experiments 
were performed to characterize the effect of the peptides on 
histone type III phosphorylation by PKC. The experiments are 
detailed in Example 11 and the results are shown in Figures 9 



25 and 10 



The results are similar to those obtained for the 
effects of peptide rVl on autophosphorylation of PKC, that is 
peptide rvi was effective to induce PKC-mediated histone 
phosphorylation in the absence of the PKC activators PS, DG, and 
30 calciu^n, once again supporting that peptide rVI is an agonist of 
i-KC activation. P.rtide rlll =i:nilarly induced histone 
phosphorylation (Fig. lO) . 
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VIII. Utility. 

A- Peptides as prob es for the idantjf ication of target 
proteins. 

WD-40 derived peptides may be used, for example, to 
5 isolate clones encoding target proteins from an expression 

library. Variations on the cloning methods described herein can 
be used to identify clones expressing sequences capable of 
binding the peptides. For example, WD-40 derived peptides may 
be used to detect a target protein on a membrane using a 
10 standard binding assay. Positive clones may be detected, for 
example, by radiolabeling the peptides and exposing the membrane 
to film. 

Target proteins isolated in this manner may be 
completely novel, or they may be partially characterized (in 
terms of a biological activity in a homogenate, or a band on a 
protein gel, for example). 

Upon isolation of a cDNA encoding a binding protein, 
the CDNA may be expressed, for example, as detailed herein, and 
the protein may be characterized. Purified protein thus 
isolated may be used for a number of applications, including the 
production of antibodies. 

Peptides designed according a method of the present 
invention may also be used, for example, as probes in a Western 
blot of a tissue homogenate to identify and determine the 
molecular weight of known or putative target proteins. 

Screens such as those described above may be 
facilitated by the modification of peptides used for screening 
to incorporate any of a variety of reporter moieties. For 
example, the peptides can be radiolabeled with '"l. 
Alternatively, the peptides can be modified with a sequence-tag 
or a iig-nd for an affinity column by marhcds knov^n to those 
skilled in f.rz 

The peptides may also be modified to covalently cross • 
link to their targets after binding, for example with any of 
various affinity reagent for cross linking known to those 
skilled in the art. This enables the isolation of target 
proteins that bind the peptides relatively weakly. 
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®- Pept^d^g rip puhst:Ature<, for d^f..h,-^o "nt-.iri,i| 
proteins. 

In cases where a WD- 40 containing protein is 
implicated in a disease (see, for example Reiner, et aJ ) 
5 peptides derived from WD-40 regions of the defective protein may 
be used as substitutes, for example, to activate a target 
enzyme. Such an approach may be more feasible than attempting 
therapy. with intact proteins. The approach has an additional 
advantage in that it does not require knowledge of the 
10 chromosomal location of the affected gene. 

The peptides can be introduced into affected cells by 
any of several methods known to those skilled in the art 
including through the use of an appropriate expression v;ctor or 
through .n vitro synthesis and administration by an effective, 
15 expedient route, m vitro studies can be carried out using 
skxnnmg or microinjection techniques. 

Peptides as nharmar^^ntical , r ^ H'^^ tr, 

us.d ''"'"^ °' invention may be 

designed so as to xnteract with endogenous target molecules to 

bHesI "dT"'' ^^^--^i-ly- peptides may 

be designed to specifically interact with target molecules 
unique to a pathogenic organism. 
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°- peptides as modul;.f»^. enzvm^ 
involved in protein -p r-^-gjn 
Peptides synthesized according to a method of the 
invention may be effective to modulate the function of a target 
.olecule_(e.g. serve as agonists or antagonists). As shown 
^-^e.^. ror example, peptides rVIIl and rVl can serv« to 

--^c.va.e or enhance rhe activ.ticn of pkc, wh.reas -.ectide I can 
mnioit PKC. .^ir^--e ± can 

These activities may be used in screens to identify 
other compounds which may affect the function of target 
molecules such as PKC. In particular, because WD-40 derived 
peptides may interact with PKC in a manner that is more similar 
to m VIVO interactions (i.e. protein binding), they may be 
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useful for identifying molecules or compounds that may interfere 
with PKC function in vivo, but might not necessarily interfere 
with PKC in vitro. . 

For example, peptide rVI can be used to stimulate PKC 
5 in the absence of traditional PKC activators, and the rVI- 
stimulated enzyme may be used in a screen to identify, for 
example, novel PKC- inhibiting or PKC-potentiating compounds • 

If constitutive activation or inactivation of a target 
enzyme is desired, peptides may be designed with integrated or 
10 derivatized cross-linking moieties. The peptides can be cross- 
linked to their targets upon binding such that the target 
molecule assumes the desired state of activity for the lifetime 
of the target molecule. 

Conversely, as described herein for PKC, peptides may 
15 also be designed so as to accelerate the degradation of the 
target molecule. For example, peptide rlll accelerated the 
• degradation of PKC in cardiac myocytes. 

E. WD-4 0 d erived peptides as specific modulators of 
isozymes. 

20 Peptides designed according to a method of the present 

invention can also be used to provide target isozyme-specif ic 
modulator molecules. For example, most cells have several PKC 
isozymes, all of which are activated by the same cellular 
stimuli. Determining the function of the individual isozymes is 

25 therefore difficult. 

WD-40 derived peptides that selectively stimulate or 
inhibit specific target isozymes or groups of isozymes may be 
useful, both in terms of therapeutic value, and in terms of 
determining the roles of different isozymes in cellular function 

30 and disease. Such information can be useful for the 

ider.tif ication of ns.v rr.oleculcr.r tarc'^ts for drug dovelcrment , as 
is described in part F, below. 
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F. Compounds designed based o n the predicted fihructure of 
binding peptide s as phamaceutical aaenhg. 
Peptides derived from WD-40 repeats may be useful for 
identifying lead compounds for drug development. Peptides as 
5 small as 7 residues have been shown herein to possess specific 
bioactivities upon interaction with their targets in vivo. The 
structure of such small peptides can be readily determined by a 
number of methods, such as NMR and X-ray crystallography. A* 
comparison of the structures of peptides similar in sequence, 
10 but differing in the biological activities they elicit in the 
target molecules, can provide information about the structure - 
activity relationship (SAR) of the target enzyme. 

For example, peptide I and RACKl-derived peptides rlll 
(SEQ ID N0:4) and rVI (SEQ ID NO: 7) had opposite effect in vivo, 
15 although they are homologous in sequence. 

Information gleaned from the examination of structure- 
activity relationships can be used to design either modified 
peptides, or other small molecules or lead compounds which can 
be tested for predicted properties (e.g. agonist or antagonist), 
as related to the target enzyme. The activity of the lead 
compounds can be evaluated using assays similar to those used in 
the evaluation of peptide -binding effects. 

Information relating to a SAR of a target enzyme may 
also be obtained from co-crystallization studies. In such 
studies, a peptide with a desired activity is crystallized in 
association with a target protein, and the X-ray structure of 
the complex is determined. The structure can then be compared, 
for example, to the structure of the target protein in its 
native state, and information from such a comparison may be used 
to design compounds expected to possess specific activities. 
Th2 compounds c-n be evaluatsd U3ing assays similar to those 
ussid in the evaluation oi peptic'- -binding effects. 

G- PCR of cDNA corresponding to WD-4Q repeats to identify 
mutations in WD-4 0 containing proteins. 
Results presented herein suggest that the middle 
regions of WD-40 motifs are involved in the association of a 
WD-40 protein with its target protein. Because this association 
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. is likely to play a central role in the activity of a 

polypeptide complex comprised of interacting proteins, some 
genetic diseases may include mutations at these regions of WD-40 
containing proteins. Therefore, if a WD-40 containing 
5 protein is implicated in a genetic disorder, it may be possible 
to use PGR to amplify DNA from the WD-40 regions to quickly 
check if a mutation is contained within one of the WD-40 motifs. 
Primers can be made corresponding to either (i) the flanking 
regions of each repeat or (ii) the flanking regions of a series 
10 of tandem repeats from the affected gene. Standard sequencing 
techniques can be used to determine whether a mutation is 
present. This method does not require prior chromosome mapping 
of the affected gene and can save time by obviating the need to 
sequence the entire gene encoding a defective WD-40 protein. 
15 H. WD-40 based poly peptides as affinitv liaanf^g 

Since the polypeptide compositions of the invention 
are able to bind proteins of interest, generically called a 
"first protein", the polypeptide compositions can also be used 
to retrieve the proteins of interest from samples and the 
20 peptides can be used as affinity ligands for chromatographic 
procedures to purify and analyze said proteins. Standard 
chromatographic techniques are employed. Typically, the 
polypeptide is coupled to a solid support and the sample 
putatively containing the first protein is contacted with the 
25 polypeptide composition of the invention; any unbound components 
of the sample are removed and, if desired, the first protein, 
bound to support, is eluted and recovered. 



1- Use of peptides in screening tests for candidates 
Various candidate compounds, not necessarily 

30 polypepti:!es, ray be showTi to bind to a first prot:sin using the 
r^lyra^ticES .-f iiv/irxtlon as corp^t.-.tors . In these 

screening assays, the ability of a candidate compound to bir.-^, 
first protein can be assessed by contacting the first protein 
with the polypeptide composition of the invention in the 

35 presence and absence of the candidate compound and evaluating 
the level of binding of the polypeptide in the presence as 
opposed to the absence of the candidate. Decreased binding of 
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the polypeptide in the presence of the candidate indicates that 
the candidate binds to the first protein. 

More broadly, the interaction of a protein with a 
polypeptide subsequence contained in the second protein can be 
5 assessed by contacting the first protein with a polypeptide 
representing the subsequence and observing any interaction with 
the polypeptide composition. 



IX • Product ion of the Peptides of the Inventinn 

The polypeptides of the invention can be prepared 
using standard techniques for the synthesis of peptides from 
amino acids. Such techniques, when conducted in solid phase 
chemistry are available commercially. 

The polypeptides of the invention may also be produced 
using recombinant methods. These methods are by now well known 
in the art; DNA molecules containing nucleotide sequences 
encoding the desired polypeptides can readily be synthesized and 
ligated into expression systems for production of the peptides 
as is understood in the art. A wide variety of hosts is 
available, including procaryotic and eucuryatic hosts. The 
construction of expression vectors, means to modify these hosts, 
and culturing the modified hosts for recombinant production of 
polypeptides are conducted using standard techniques. 

The following examples illustrate, but do not limit 
the present invention. 



Materials and Methods 

Nitrocellulose filters were obtained from Schleicher and 
Schuell (Keene, NH) . 

Synthetic peptides were prepared using commercially 
available automated peptide s^-nthesizers . Alternatively, custom 
c-3igned peptides may be purchased, for exarncla, from Bachem 
Bioscience (King of Prussia, PA). Peptides may also be prep-r-: 
recombinantly by expressing oligonucleotide sequences encoding z'.-..: 
peptides. The oligonucleotide sequences may be either synthesi-^d 
directly by standard methods of oligonucleotide synthesis, or, in 
the case of large coding sequences, synthesized by a series of 
cloning steps involving a tandem array of multiple oligonucleotide 
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fragments corresponding to the coding sequence (Crea; Yoshio, et 
al.; Eaton, et al.). Oligonucleotide coding sequences can be 
expressed by standard recombinant procedures (Maniatis, et al.; 
Ausubel, et al.) . 

5 "Triton" refers to a nonionic detergent comprising a 

polyoxyethylene ether and other surface-active compounds. An 
exemplary Triton detergent is -TRITON X-lOO", available from Sigma 
caiemical Company, St. Louis, MO. 

"Tween" refers to a nonionic detergent comprising 
10 polyoxyethylenesorbitan monolaurate with a fatty acid composition 
of approximately 55% lauric acid, with a balance composed primarily 
of myristic, palmitic and stearic acids. An exemplary Tween 
detergent is "TWEEN 20", available from Sigma Chemical Company, St. 
Louis, MO. 

"SDS" refers to sodium dodecyl sulfate. 

"PAGE" refers to polyacryl amide gel electrophoresis. 

"IPTG" refers to isopropyl S-D-thiogalactopyranoside . 

Examnle l 

Expression OTonina nf a PKC-h-inrlrna Pr-ftho-ir, 
20 A. Buff era . 

Overlay block buffer: 50 mM Tris-HCl (pH 7.5), 0.2 M 
NaCl, 3% bovine serum albumin (BSA) and 0.1% polyethylene glycol. 

Overlay buffer: 50 mM Tris-HCl (pH 7.5), 0.2 M NaCl 12 
mM 2-mercaptoethanol, 0.1 % BSA, 1% polyethylene glycol, lO/zg per 
25 ml soybean trypsin inhibitor and lO/xg per ml leupeptin. 



B. 



30 



Isolation of a PKC-bindiner r^nwa cIotip Qver1;.v 
assay. 

A rat brain (Sprague Dawley) cDNA expression library, 
constructsd in the lambda phage cloning v-ctor "UMI-2AP XR" 
(Str.-tarens, La JoUa, CA) , was scraaned by an ovsrlav assay as 
follows . 

Lifts of nitrocellulose filters from IPTG- induced cDNA 
library plates were incubated for 2 hours in overlay block buffer. 
The filters were then transferred to overlay buffer with or without 
35 1 unit of a mixture of rat brain PKC isozymes (a, /J, y, 5, e and 
-10 nM final concentration each) and incubated for 20 minutes 
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at room temperature with pkc activators (60 /zg/ml 
phosphatidylserine (PS) , 2 ng/ml diacylglycerol (DG) , i nuM CaClj) . 

Following three 15 minute washes in the overlay buffer, 
the filters were incubated in the overlay block buffer in the 
5 presence of a mixture of monoclonal anti-a, jS and y PKC antibodies 
(1:1000 dilution; Seikagaku Kogyo, Tokyo, Japan) and polyclonal 
anti-6, £ and f PKC antibodies (1:500 dilution; Life Technologies, 
Gaithersburg, MD) . After a 16 hr incubation at room temperature,' 
the filters were washed three times, 15 minutes per wash, in 
10 overlay buffer. 

Binding of PKC was determined using alkaline phosohatase- 
conjugated goat anti-rabbit or goat anti-mouse antibodies* (1:2000 
dilution, Boehringer Mannheim Biochemicals, Indianapolis, IN). The 
alkaline phosphatase reaction used 5-bromo-4-chloro-3-indoyl 

15 phosphate p-toluidine salt as a substrate, and was performed 
following the manufacturer's protocol. 

Library screening of 2.4 x 10« recombinant "UNI-ZAP" 
lambda phage plaques yielded one clone, pRACKl, that reacted with 
anti-PKC antibodies in the PKC overlay membrane, but not in the 

20 control overlay membrane. These results suggest that pRACKl 
encodes a PKC binding protein. 

^- Cloning and seguencina cDNTA f^ o m pofiiM-.re n^^cp ^^« 

The clone pRACKl, identified as detailed in part B above 
was plaque purified and cDNA inserts were isolated as phagemids by 
25 xn ^rivo excision of the cloning vector, according to the 
manufacture's protocol (Stratagene, La Jolla, CA) . DNA sequencing 
of PRACKl was carried out using standard di-deoxy sequencing 
techniques (Maniatis, et al.) The DNA sequence of RACKl is shown 
in Figure lA. The sequence is also contained in the Sequence 
30 Listing as SEQ ID ?T0:19. 

Example 2 

Pxpy^ssion and Piiri f ication of Recoinbin;,ni- pa c ki Prn^^■in -i^ p 

A Pstl/Xhol DNA fragment containing an open reading frame 
35 of 317 amino acids from the putative translation start site of 
PRACKl (see underlined ATG in Fig. lA) and 8 additional nucleotides 
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upstream of the initiating methionine was subcloned into E coli 
expression vector pMAL-c2 (New England BioLabs, Beverly, MA) Th^s 
vector contains the malE gene, which encodes maltose-bindi^g 
. protein (MBP) . Induction of E. coli containing the vector results 
5 xn the production of an MBP-fusion protein (Ausubel, .et al ) The 
vector also includes a recognition site for the protease factor Xa, 
which allows the protein of interest to be cleaved from MBP after 
purification without adding any vector- derived residues to the 
protein. 

10 A culture of TBI E. coli transformed with RACKl- 

containing pMAL-c2 was induced by a 3 hr incubation with 1.8 mM 
IPTG. A protein fraction containing a 78 kDa fusion protein 
comprxsed of RACKl fused to MBP was isolated from the cultured E 
coiz by standard methods (Ausubel). The fusion protein was 
purxfxed on an amylose affinity column according to the 
manufacture's protocol (New England BioLabs, Beverly, MA) and 
xncubated with protease Xa (New England BioLabs) to yield a 36 kDa 
protexn (RACKl) and a 34 kDa protein (possibly a RACKl degradation 
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Example 

Bindincf of pyr to RAr.om>,-i .naTH- parvi 
A. Buf ferfs. 

3 mM ,rri ""f ^ ^ ^ Na,PO„ 1.5 mM KH,PO„ 

3 mM KCl and 0.05% Tween at pH 7.0. 

12 ™m , "'^''^ '° ^""-"^^ 7.5) , 0.2 M NaCl, 

12 mM 2-mercaptoethanol, 0.i% polyethylene glycol and O.i .mM CaCl,! 

Overlay assay. 

p.o...=d "combinant HACKl protein (100-250 ,g per lane, 

P-cd .d o=t..x3d xn Example 2) w.s subjected to SDS/PAGE and 
^^cte-: onto Tiitrocellulosa rrsrnbra.nea C.usubel) . The 
nxtrocellulose membranes were cut into strips, which were inc: -—' 
for 0.5 hr in overlay buffer (Example 1) in the presence or ab;. '- ' 

oho!^^.T " activators (so ,g/ml 

LciT X'T' ' '"'"^ <^iacylglycerol (DO), and T'I^ 

CaCl,) . unbound material was removed by five washes, 5.min each. 
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in overlay wash buffer. Where indicated, PKC activators were 
present during the incubation of PKC with the nitrocellulose 
strips. The conditions for each sample and corresponding results 
are presented in part D below. 

C. Detection of bound PKC. 

PKC bound to RACKl inunobilized on nitrocellulose strips 
was detected as follows. The strips were incubated for 16 hours 
at room temperature with a mixture of anti-PKC antibodies as 
detailed in part B of Example 1, and then washed three times, 15 
minutes per wash, with PBS/Tween buffer. The strips were incubated 
with anti -mouse and anti-rabbit horseradish peroxidase-linked 
secondary antibodies (Amersham Life Science, Arlington Heights, IL) 
diluted 1:1000 in PBS/Tween buffer supplements with 2% BSA, for 1 
hour at room temperature. After washing three times, 15 minutes 
15 per wash with PBS/Tween buffer, the strips were subjected to a 
cherailuminescent reaction with luminol (diacylhydrazide) as 
detailed in the maufacturer's protocol (Amersham Life Science, 
Arlington Heights, IL) , followed by an immediate exposure to 
autoradiography film (Eastman Kodak, Rochester, NY) for 30 seconds 
20 to 5 minutes. 

D- Effects of PKC a ctivation on PKC binding to RACKl. 

The results presented in Figure 2 show the influence of 
PKC activators on the binding of PKC to RACKl immobilized on 
nitrocellulose membranes. The overlay assay was carried out as 

25 described in part B above. The test reagents contained in each 
sample and the corresponding lanes on the blot presented in Fig. 
2 are as follows. Lane 1: PKC, 60 ng/ml PS, 2 fig/ml DG and 1 mM 
CaCl, ; lane 2: PKC and 1 m.M EGTA; lane 3: PKC, 60 /xg/ml PS and 2 
ixg/ml DG; lane 4: PKC a.nd 1 nM CaCl;; lar.e 5: No PKC added; lanes 

30 ^ ?.nd 7: PKC, SO ug/ml F3 2 ;ig/ir.l D?-, 1 CaCl,, and 10 /x?-! 

substrate peptide (SEQ ID NO: 11; lane 6) or 10 nM pseudosubs : - •• 
peptide (SEQ ID NO: 12; lane 7) . The results are representative 
three independent experiments. 

It can be appreciated that the binding of PKC as detected 

35 by anti-PKC antibodies is minimal in the presence of EGTA or 
calcium alone (Fig. 2, lanes 2, 4, respectively), is greater in the 
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presence of phosphatidylserine (PS) and diacylglycerol (DG; lane 
3), and is maximal in the presence PS, DG and calcium (lane i).. 
Antibody binding was not observed in the absence of added PKC (lane 
5) . Furthermore, maltose binding protein alone, or an extract from 
non- trans formed E. coli did not bind PKC. 

The concentration dependence of PKC binding to RACKl was 
characterized with ^PKC, since this isozyme is a major component 
of the PKC mixture used for the overlay assay. The mean half 
maximal binding was -0.375 nM, and maximal binding was -4 nM (n=3; 
values reflect binding of /JPKC isozyme in the presence of other PKC 
isozymes and was determined by scanning autoradiograms in the 
linear range of detection, as described in Mochly-Rosen, et al 
(1991). 

The results presented above indicate that in order for 
15 PKC to bind to RACKl it must be activated. In vitro, activation 
may be accomplished, for example, by phosphatidylserine and 
diacylglycerol, or, more preferably, by phosphatidylserine, 
diacylglycerol and calcium. 
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Example 4 

Inhibition of PKC Rinding ^o VACKI bv Partr i-spf^ri f j ^ t^ -^n- 

homoloae us Pent: j 
Assays for the inhibition of PKC binding to RACKl by 
putative binding peptides were carried out by combining a variation 
of the overlay protocol described in Example 3 part B above, with 
an overlay extraction assay described in part B below. The 
variation in the overlay protocol consisted of incubating the 
putative binding peptides with a mixture of PKC isozymes for 15 
minutes at room temperature before the mixture was used to contact 
the nitrocellulose strips containing immobilized RACKl. 

A . Su rf g r?^ 

Sample buffer: 0.3 M Tris HCl, 5% SOS, 50% glyco:-! 
0.01% bromophenol blue and 5% ^-mercaptoethanol . 
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B. Overlay extraction Pi-otocol . 

Nitrocellulose strips containing immobilized RACICI, that- 
had been contacted with a solution containing a mixture of PKC 
isozymes, were washed and the area corresponding to the 36 kDa 
5 (RACKi-containing) band was cut out. The pieces (containing 
PKC/RACKl complexes) were incubated with sample buffer for 10 
minutes at 80»C. The sample buffer and the nitrocellulose pieces 
were then placed in wells in the PAGE gel and subjected to SDS-PAGE 
to elute the bound proteins. The gel was blotted onto 
10 nitrocellulose and a Western blot analysis was carried out using 
the mixture of antibodies (specific for PKC a, 0, y, 5, e and f 
isozymes) described in Example 1 part B. Bound antibodies were 
detected by '"l -protein A. 
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^' PKC overlay in the oresencP o f binding pentiH^g. 

Peptides derived from or homologous to WD-40 repeats of 
RACKl were tested for their ability to inhibit PKC binding to 
recombinant RACKl. Binding of PKC to RACKl was carried out using 
a variation of the overlay procedure described in Example 3 part 
B. In the experimental samples, peptides were incubated with a 
solution containing a mixture of rat brain PKC isozymes (-10 nM 
each) for 15 minutes at room temperature. 

Following completion of the modified overlay protocol, 
the samples were subjected to the overlay-extraction protocol 
detailed in part B, above. 

The results in Figure 3 show the binding of PKC to RACKl 
carried out without (lane 1) or with (lanes 2-4) a preincubation 
of peptides with PKC. Lane 2 shows PKC binding following a 
preincubation with 10 /zM peptide I (SEQ ID N0:1) . Peptide I caused 
an 8l±6% inhibition of PKC binding to recombinant RACKl as compared 
with binding in the absence of added peptide (n»3) . Lanes 3 and 
4 Shew PKC binding follc-.'ir.g a preincubation with 10 «M peptide 
nil (SEQ ID N0:4) and 10 peptide rVI (SEQ ID NC:7'., 

respectively. Both peptides inhibit the binding of PKC to RACK.! 
It can be seen that peptide rlll is somewhat more effective than 
35 peptide rVI. The results shown are representative of three 
independent experiments . 
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The overlay -extract ion method (part B above) was used in 
experiments relating to the psptide inhibition of PKC binding in- 
order to decrease the possibility that some part of the inhibition 
of PKC binding to RACKl reflects an interference in the binding of 
5 anti-PKC antibodies to the PKC/RACKl complexes. Free peptides are 
effectively removed from the PKC/RACKl complexes during the second 
round of SDS/PAGE, prior to blotting and detection of immobilized 
PKC/RACKl complexes by anti-PKC antibodies. 

Example 5 

^0 Identification of Sequenced Proteins Containing WD-40 Repeats 

A search for WD-40 motif -containing proteins was done 
using the ENTRE2 program, release 6.0 (National Center for 
Biotechnology Information, National Library of Medicine, National 
Institutes of Health, Bethesda, MD) . The ENTREZ database was 

15 searched for protein sequences related to the /3 subunit of 
transducin. 

Protein sequences homologous to /S- transducin were 
examined for the existence of WD-40 repeats, following the guidance 
for identification of WD-40 repeats presented in section V of the 
20 specification, above. 

The proteins were also used to carry out additional 
searches of the database, in order to identify other proteins which 
may contain WD-40 repeats, but which might not be homologous to the 
/3 subunit of transducin. Sequences identified during the second 
25 round of searches were again examined for WD-40 repeats. 

This search strategy identified 30 proteins containing 
T/D-40 sequences. The amino acid sequences of these proteins, with 
the WD-40 regions aligned and delineated, are shown in Figs. 12-18, 
-20-27, 23-30, 34-35, 37-38, 40 and 42-50. The sequences are 
30 represented in ^.he Sequence Listing as SEQ ID NO:29-35, 37-44, 46- 
47, Si-'rS, 57 and 59-57. An r.yaminaticn of the sequences 

in the figures reveals that although there can be diverc^r.c^. 
between the WD-40 motifs of different proteins, a consistent ' 
pattern can be inferred based on the teachings presented in part 
35 V of the specification above. 

An additional search, using a consensus WD-40 sequence 
(SEQ ID NO: 262), was conducted with the "MACVECTOR" program 
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(Eastman Kodak Co., New Kaven, CT) to search GenBank (December 1993 
release) . Default settings (matrix=250) were used for the search.- 
The search identified the 250 proteins with the highest homology 
to the consensus sequence. These proteins were examined, as 
detailed in part V above, for WD-40 repeats. Ten additional 
proteins containing WD-40 repeats were identified with this 
strategy. The amino acid sequences of those proteins, with the WD- 
40 repeats aligned and delineated, are shown in Figs. 11, 19, 28, 
31-33, 36, 39, 41 and 51. The sequences are represented in the 
Sequence Listing as SEQ ID NO:28, 36, 45, 48-50, 53, 56, 58 and 68. 

Example 6 

Binding of BPKC t.n d acKI Wn-4n-derivpH Pop «-,-/i^, 
A. Buffers. 

Peptide overlay block buffer: 20 mM Tris-HCl (pH 7.5), 
2 M Naci, 3% bovine serum albumin (BSA) and 0.1% polyethylene 



15 0 

glycol 
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Overlay wash buffer: 50 mM Tris-HCl (pH 7.5) , 0.2 M NaCl, 
12 mM 2-mercaptoethanol, 0.1% polyethylene glycol and 0.1 mM CaClj! 

PKC overlay of immnhn ized n^nh^^^p , 

The binding of ^PKC to peptide I (SEQ ID N0:1), peptide 
rvi (SEQ ID NO: 7) and control peptide (SEQ ID N0:9) was assessed 
using a PKC overlay assay similar to that described in Example 3 
Increasing amounts of peptides (0.5 /imole, 1.0 /zmole, 5.0 /xmole and 
10.0 nmole) suspended in 20 mM NaCl were applied individually onto 
nitrocellulose using a slot-blot apparatus (Schleicher and Schuell 
Keene, NH) . The nitrocellulose membrane was washed three times' 
15 minutes per wash, in peptide overlay buffer and incubated for 
two hours in peptide overlay block buffer. The membrane was cut 
into sections and the sections were transferred to different PKC- 
ccntaining solutions ,^nd incubated for 30 minutes at room 
cernperature. All the solutions contained 5 nM rat brain P"^ n 
peptide overlay buffer. Some solutions additionally contained Pi, 
DG, and calcium. The membranes were then washed three times 15 
minutes per wash, in peptide overlay buffer and incubated in 
peptide overlay block buffer containing anti-^PKC monoclonal 
antibodies (1:1000 dilution; Seikagaku Kogyo, Tokyo, Japan). After 
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a 16 hr incubation at room temperature, the filters were v/ashed 
three times ^ 15 minutes per wash, in peptide overlay buffer. 

Binding of PKC was determined using chemiluminescence as 
described in Example 3, part C. Quantitation of PKC binding was 
5 carried out using a "MICRO SCAN" 1000 gel analyzer (Galai Inc., 
Yokneam, Israel) . 

The data show that activated PKC bound to both peptides 
I and rVI, but not to the control peptide, at peptide amounts as 
low as 5 /zmoles. Unactivated PKC did not bind to peptide I, but 
10 did bind to peptide rVI at similar concentrations. 

The results indicate that peptide rVI is capable of 
binding both activated as well as unactivated forms of PKC, whereas 
peptide I binds only to activated PKC. 



Example 7 

^5 Effects of RACKl WD*40-derived Peptides on PKC-mediated Oocyte 

Maturation 

Exposure to insulin induces maturation in Xenopus oocytes 
via a PKC-dependent pathway (Smith, et al., 1992) . The maturation 
response may be quantified by monitoring the appearance of a white 

20 spot in the animal hemisphere of the oocyte, indicating germinal 
vesicle breakdown (GVBD) and maturation. To assess the effects of 
RACKl WD-40-derived peptides on insulin- induced PKC-mediated 
maturation, 50 nl of a 20 mM NaCl solution containing the indicated 
peptides [peptide I (SEQ ID N0:1; •) , peptide rVI (SEQ ID NO: 7; M) , 

25 or injection solution (□) ] (peptides at 50 /xM) were microinjected 
into Xenopus oocytes. The symbols refer to symbols used in Figure. 
5, which shows the data from this example. One hour following the 
peptide injections, the oocytes were exposed to a solution 
containing insulin (8.25 fig/ml) for 2 minutes {t=0) . 10-15 oocytes 

30 were used for each sample. 

The data, representative; of three independent 
experiments, are expressed as the percent of oocytes with J'.*2r. 
following insulin exposure and are plotted as a function of time 
in Figure 5. 

oocytes injected with buffer or control peptide, onset 
of maturation was typically 4-5 hours after exposure to insulin. 
Following this delay, %GVBD followed an approximately exponential 
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time-course, reaching a plateau of about 85-90% GVBD at abou^ 10-12 
hours. These data indicate that approximately 80-85% of sham- 
injected oocytes exposed to insulin at t=0 reach maturation, and 
that maturation is reached relatively quickly (within about 10 
hours) relative to the time-course of the experiment (20 hours) . 

Oocytes injected with peptide I (SEQ ID N0:1) responded 
in a manner similar to control oocytes, except the plateau was at 
about 45-50% GVBD. These data suggest 'that injection of peptide 
I blocked maturation in approximately 40-45% of oocytes that would 
normally proceed to maturation, but had little effect on the 
kxnetics or extent of maturation of the remaining (50-55%) oocytes. 

Oocytes injected with peptide rVI (SEQ ID N0:7) responded 
wxth a slightly shorter delay (about 3-4 hours), but reached a 
higher plateau (about 95-100% GVBD) more quickly (within about 5 
15 hours) than control oocytes. These data suggest that peptide rVI 
potentiates the effects of insulin on oocyte maturation, both in 
tenns of the rate of maturation, and in the total fraction of 
oocytes that mature during the experiment. Injection of peptide 
rVI increases the maturing fraction to essentially ioo% 
20 The effects of both peptides I and rVl on GVBD were dose- 

dependent between 5 ^m-500 (M. 

Since peptide rVI enhanced insulin- induced GVBD 
experiments were performed to determine whether peptide rVI can 
induce GVBD in the absence of insulin. The "data from these 
experiments are shown in Fig. 5B. Microinjection of peptide rVI 
(50 ;.M) alone, but not peptide I, control peptide or buffer 
induced GVBD. Maturation initiated with a longer delay (about G-l 
hours) than in the control insulin- induced oocytes in Fig sa 
(about 4-5 hours), and reached a plateau of about 50% GVBD 
^ , Together, the data above indicate that peptides 
homologous to the VID-AO region of RACKi modulate the function of 
-.<C. Fept.ce I inhibited PKC-r-.ediated oocyte maturation bv about 
40^, wnereas peptide rVl pctencia.ad in3ulin-induced maturae^- 
and resulted in a limited maturation response even in the absen. - 
of insulin. The latter result suggests that peptide rVI, under 
appropriate circumstances, may act to activate PKC in the absence 
of other activating substances. 
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Example 8 

Effects of RACKl WD-40 -derived Peptid es on PKC TranalonaMV.n ir. . 

XenoDus Oocytes 

A. Buffers. 

5 Homogenization buffer: 20 mM Tria HCl, pH 7.5, 10 mM 

EGTA, 2inMEDTA, 0. 2 5M sucrose, lO^M phenylmethylsulfonyl fluoride, 
20/xg/inl of each leupeptin and soybean trypsin inhibitor. 

B. PKC translocation in oocytes. 

Insulin causes the translocation of /JPKC, but not other 

10 PKC isozymes, from a cytosolic form to a membrane-associated form, 
as evidenced by the relative levels of PKC in the soluble vs. the 
particulate fraction of oocyte homogenate. To assess the effects 
of RACKl WD-40-derived peptides on insulin- induced PKC 
translocation, 50 nl of a 20 mM NaCl solution containing the 

15 indicated peptides were microinjected into Xenopus oocytes. The 
oocytes were then homogenized, and the relative amoumt of PKC in 
the soluble and particulate fractions was assayed. The protocol 
followed was a modification of a method described by Smith, et al. 
(1992). The results are shown in Figure 6. 

^° Batches of 50 oocytes were microinjected with either 

peptide rVI (SEQ ID NO: 7; 50 /tM; lanes 3, 4), peptide I (SEQ ID 
N0:1; 50 ^M, lanes 7, 8) or injection solution (NaCl 20 mM, lanes 
1,2 and 5,6). Homogenates from each batch were prepared 60 
minutes after microinjection (lanes 1-4) or 60 minutes after 

25 addition of insulin (lanes 5-8) . The homogenates were centrifuged 
at 10,000 g for 3 minutes, the upper layer (containing fat and 
yolk) was removed, and the remainder was frozen at -70 "C. Prior 
to use, the samples were thawed, 200 /xl homogenization buffer was 
added and the samples were centrifuged at 100,000 g for 30 minutes 
0 at 4 »C. The supernatants (oolubla fraction) were rer.oved and 
concent rat-d to ZO ^1 using '•C£:nRICOM". ooncontrators 
(Amicon. Beverly, MA). The pellets (particulate fractions) -.-r- 
dissolved in 20 pi of homogenization buffer. The samples v;-.- 
resolved on an 8% SDS/PAGE gel and blotted onto nitrocellulcoa . 

5 The amount of PKC in each fraction was determined by Western blot 
using anti-^PKC antibodies (1:1000 dilution; SeikagaJcu Kogyo, 
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the WD-40 region of RACKl alter the sensitivity of /SpkC to 
endopeptidase Arg-C. 

The methods used to assay Arg-C sensitivity are a 
modification of methods described by Orr, et al. Rat brain PKC (- 
5. 5 nM) was incubated at room ten^jerature in 500 /tl of 20 mM Tris-HCl 
buffer (pH 7.5) alone or with Arg-C (5 units/ml) in the presence 
or absence of the indicated peptides (final concentration 10 fM or 
as indicated) , PS, DG, and calcium (as indicated) . 50 /tl aliquots 
were removed into 20 ^1 of sample buffer during the reaction as 

10 indicated (samples in all the lanes were incubated for 30 minutes 
, except lanes 5, and 6, which were incubated for 5 and 15 ir.inutes, 
respectively) . The samples were boiled for 10 minutes at 80«>C and 
loaded onto 8% SDS-PAGE. j5PKC was detected by Western blot 
analysis using anti-jffPKC antibodies as described in Examples 6 and 

15 8, 

The results are shown in Figure 7. PKC was incubated for 
the indicated time alone (lane 1) or in the presence of Arg-C 
(lanes 2-9), with DG (0.8 /ig/ml) , PS (50 /tg/ml) and CaClj (1 mM; 
lane 2), with PS (50 fig/ml) and CaCl^ (1 mM; lane 3), with PS (2.5 
20 Mg/ml) and CaClj (50 ^M; lane 4); with PS (2.5 fig /ml) , CaClj (50 ^M) 
and with either peptide rVI (SEQ ID NO: 7; 10 ^M; lanes 5-7), 
control peptide (SEQ ID NO: 9; lane 8) or with peptide I (SEQ ID 
N0:1; lane 9) . 

Incubation of jSPKC with Arg-C at low concentrations of 
activators (2.5 ,ig/ml PS and 50 /iM CaCl,) in the absence of added 
peptide did not result in appreciable niclcing activity (Pig. 7, 
lane 4) . Similarly, nicking of $PKC did not occur in the presence 
of this concentration of activators with peptide I (lane 9) or with 
control peptide (lane 8) . However, incubation of /?PKC with the 
same concentration of activators in the presence of peptide rVI 
results.! in a time -dependent appesranccj of the 78 kDa nicked PKC 
-Jragr.e-t >?\a. 4, lan-s 5-7). C-.r.cer.tr aniens 33 low as 10 nM of 
peptide rVI were sufficient to result in nicking activ:.-- 
indicative of /JPKC activation. The results indicate that pepti 
35 rVI, but not peptide I,, is effective to stabilize PKC in aa 
activated conformation that renders it susceptible to Arg-C under 
conditions of low PKC activators that would otherwise not render 
the enzyme susceptible to Arg-C. 



25 



30 
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£>::=> TT.ple 10 

Effects of RACKl WD- 4 0 -derived Peptides on PKC 

Autophosphorvlation 
Activated PKC is capable of autophosphorylation. Since 
5 peptide rVI (SEQ ID NO: 7) was effective to induce PKC translocation 
and GVBD in the absence of an activator such as insulin, the 
ability of the peptide to induce PKC autophosphorylation in the 
absence of PKC activators was assessed. 

PKC autophosphorylation in the presence of /JPKC 
10 pseudosubstrate antibodies or the indicated peptides was carried 
out using a , modification of the method described by Makowske, et 
al. Ant i -pseudosubstrate antibodies, which were shown previously 
to induce autophosphorylation in the absence of PKC activators 
(Makowske, et al.) were used as a positive control* The results 
15 are shown in Figure 8. 

Rat brain PKC 10 nM) was incubated with mild agitation 
in a final volume of 250 /zl of overlay buffer, as in Example 1 
either with anti-/?PKC pseudosubstrate antibodies (1:10 dilution. 
Life Technologies, Gaithersburg, MD) or with the indicated peptide 
20 (10 fiM) . Where indicated, PS (50 fig /ml) , DG (0.8 /ig/ml) and CaClj 
(1 mM) were also added. The amount of autophosphorylation was 
determined after 2 hours for the reaction with the anti- 
pseudosubstrate antibodies, or after 15 minutes for the other 
samples. 50 /il of a buffer comprised of 20 mM Tris-HCl (pH 7.5), 
25 20 mM MgClj, 20 /zM ATP and 5 fici/ml [7-32p]ATP. The mixture was 
incubated for 15 minutes at room temperature and the reaction was 
stopped by adding 60 /zl sample buffer (see Example 9) . The samples 
were then boiled for 10 minutes, loaded onto a 10% SDS-PAGE mini 
gel and electrophoresed. The gel was fixed with 50% methanol and 
30 10% acetic acid for 1 hour, and the autophosphorylation of PKC was 
determined by aucoradicgraphy , 

Th?i i-e3ult3 in Figure 8 shew PKC autcphcsphci-:/laticn in 
the presence of DG, PS, and calcium (lane 1), in the presence ' 
EGTA (lane 2), in the presence of anti-/?PKC pseudosubstra:: : 
35 antibodies (diluted 1:10 in 20 mM Tris-HCl; lane 3), in th- 
presence of peptide rVI (SEQ ID N0:7; 10 /iM; lane 4), in the 
presence- of peptide I (SEQ ID N0:1; 10 /iM; lane 5), or in the 
presence of control peptide (SEQ ID N0:9; 10 /xM; lane 6). 
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Peptide rVI in the abzence of PKC activators induced PK^ 
autophosphorylation to over 80% of the autophosphorylation obtained- 
in the presence of optimal concentration of PS, DG, and calcium 
(compare Pig. 8 lane 1 (control) with lane 4 (peptide rVl) 
Neither peptide I nor control peptide induced PKC 
autophosphorylation in the absence of PKC activators (Fig. 8 lanes 
5 and 6, respectively) . 



15 



20 



25 



35 



Example n 

Effects of RACK l WD-40 - riP rived P^pHdes on et-^^^ 
PhosahnT-yl a tion by Plff 
Incubation of PKC with peptide rVI (SEQ id NO: 7) induced 
histone phosphorylation by PKC. The method used was a modification 
of the protocol described by Mochly-Rosen, et al. (1987). The 
results are shovm in Figure 9. 

Histone type Ills (Sigma Chemical Company, St. Louis, MO) 
was phosphorylated by PKC (- lo nM) in the absence (lane 1) and 
presence of peptide rVl (lo ^m) (lanes 2 and 3) and in the presence 
and absence of DG (0.8 ^g/ral) , PS (50 ^g/„,i) and CaCl, (i mM) 
(lane 3) . The results are expressed as percentage of control that 
xs the amount of Histone phosphorylation by PKC in the presence of 
DG (0,8 Mg/ml), PS (50 ^gMl) and CaCl, (1 mM) . The results are 
the average ± SEM of two independent experiments. PKC was first 
xncubated with the peptide rVl (10 ^M) for 15 minutes in overlay 
buffer as described above. Histone type Ills (40 Mg/n,l) was added 
in Tris-HCl (20 mM) , MgCl, (20 mM) , ATP (20 ^M) and [^-"pjATP (5 
/^cz/ml) with or without PS (50 ;.g/ml) . DG (0.8 ^g/rnl) and CaCl, (1 
Tove P^°^Phorylation was determined by autoradiography as 

PKC activators PS. DG, and calcium were not required for 
eit..er peptide rVI-induceJ autcphosphorylation or h^' stone 
...n..pnrrylat.cn, 3ugc33cinj th- ^ap-d. r7I is an agonist of PKC 
activation. 

In a related experiment, phosphorylation of histone 

Ills (25/zM) by PKC (10 nM) was not inhibited by. RACKl; rather^; "'a 
4.5±0.l fold increase of histone phosphorylation occurred when co- 
incubated with -lOO nM RACKl {n=2) . 
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SSQUEMCS LISTING 



(1) GENERAL INFORMATION: 

5 

(i> APPLICANT: Mochly-Rosen, Daria 
Ron, Dorit 

(ii) TITLE OF INVENTION: WD-40 - Derived Peptides and Uses 
10 Thereof 

(iii) NUMBER OF SEQUENCES: 265 

(iv) CORRESPONDENCE ADDRESS: 
^5 (A) ADDRESSEE: Dehlinger & Associates 

(B) STREET: P.O. Box 60850 

(C) CITY: Palo Alto 

(D) STATE: CA 

(E) COUNTRY: USA 
20 (P) ZIP: 94306-0850 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

25 (C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #l.o. Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/190,802 
30 (B) FILING DATE: Ol-FEB-1994 

(C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 
(A) NAME: Fabian, Gary R. 
35 (B) REGISTRATION NUMBER: 33,875 

(C) REFERENC3/D0CK2T ITOMBER: 8600-0139 

(:ix) T2L2CC>urw':iICATI0N I^FORI-'ATICI-T : 

t::i:2^:ic:/e: (415) 324-os3c 

40 (B) TELEFAX: (415) 324-0960 



(2) INFORMATION FOR SEQ ID N0:1: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

5 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

10 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Peptide I 



15 



20 



25 



35 



40 



(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

Lys Gly Asp Tyr Glu Lys lie Leu Val Ala Leu Cys Gly Gly Asn 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unJoiown 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

30 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Peptide, rl, Fig. IC 



(xi) SEQUE:ICS DESCRIPTION: SEQ ID N0:2: 

Val Thr Gin lie Ala Thr Thr 
i 5 

(2) INFORMATION FOR SEQ ID N0:3: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 7 amino acids 
45 (B) TYPE: amino acid 

(D) TOPOLOGY: un)aiown 
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(ii) MOLECCLE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

5 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Peptide rll. Fig. ic 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Phe Val Ser Asp Val Val He 
1 5 

(2) INFORMATION FOR SEQ ID NO: 4: 



10 



15 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 7 amino acids 
20 (B) TYPE: amino acid 

(D) TOPOLOGY: untoown 

(ii) MOLECULE TYPE: peptide 

25 <iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 



30 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Peptide rlll. Pig. ic 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 



35 Asp Val Leu Ser Val Ala Phe 

1 5 

(2) INrOPv^\TION FCR SEQ ID 110:5: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknovm 

45 (ii) MOLECULE TYPE: peptide 



ao 
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(iii) HYPOTHETICAL: NO 

(iv) ANTI-SEMSE: NO 

5 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: peptide riV, Fig. ic 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:5: 

Val Ser Cys Val Arg Phe Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

20 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
25 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Peptide rV, Fig. ic 



15 



30 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Gly Tyr Leu Asn Thr Val Thr 
1 5 

(2) INFORMATION FOR SEQ ID NO: 7: 



(i) S2QUS::rC2 CHAPACTERISTICS: 
(A) LENGTH: 8 amino acids 
40 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

45 (iii) HYPOTHETICAL: NO 
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(iv) ANTI -SENSE: NO 



PCT/US9SA>1210 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Peptide rVI, Pig. ic 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:7: 

Asp lie lie Asn Ala Leu Cya Phe 
10 1 5 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: un)aiown 

(ii) MOLECULE TYPE: peptide 

20 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

25 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Peptide rVII, Fig. ic 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

30 

Pro Gin Cys Thr Ser Leu Ala 
1 5 ■ • 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUE!;CS CHARACTERISTICS: 
(A) LEIMTH: 6 amino acids 
(3) TYP3; amino acid 
x^'} IC c OLC C- / : unknc-.sTi 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



35 



40 
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(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: control peptide 1, homol. to lUCXl 
261-266, LKGKIL 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Leu Lys Gly Lys lie Leu 
1 5 



10 



30 



35 



40 



(2) INFORMATION FOR SEQ ID NO: 10: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 6 amino acids 
15 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

20 (iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 
25 (C) INDIVIDUAL ISOLATE: control peptide 2, iden. to RACKl, 

265 to 270 IIVDEL 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

He He Val Asp Glu Leu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 anir.o acids 
(3) r\'?2: anino acid 
(D) TOPOLOGY: unloiJAii 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 



45 



(iv) ANTI -SENSE: NO 
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(vi) ORIGIHAI. SOURCB: 

(C) INDlvIDraL ISOIATS: PKC substrate peptide, (Ser25) 
PKC (19-36) 



(xi) SEQOENCE DESCRIPTION: SEQ ID KO:ll: 

Arg Phe Ala Arg Lys Gly Ser Leu Arg Gin Lys Asn Val His Glu Val 

Lys Asn 



15 



20 



30 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unlcnovm 



(ii) MOLECDLE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
25 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: PKC Pseudosubstrate Inhibitor 
{PCK(19-36)) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



Arg Phe Ala Arg Lys Gly Ala Leu Arg Gin Lys Asn Val His Glu Val 

15 



" 1 s 10 



Ly.3 A3n 



40 (2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 
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(li) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

5 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: GBH Peptide, rl. Fig. 24 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Trp Val Thr Gin lie Ala Thr Thr Pro Gin Phe Pro Asp Met lie 
15 10 15 

15 (2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
20 (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



25 



30 



(vi) ORIGINAL SOURCE: 

(C) INDrVIDUAL ISOLATE: GBH Peptide rll, Fig. 24 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Phe Val Ser Asp Val Val He Ser Ser Asp Gly Gin Phe Ala Leu 
35 1 5 10 15 

(2) i:7F0F^w\TI0i; FCR SZQ ID 170:15: 

(i) SEQ'JZNCE CHARACTERISTICS: 
40 (A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(D) T0P03J0GY: unknown 



45 



(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
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(iv) ANTI-SENS2: NO 

(vi) ORIGIKAIi SOURCE: 
^ (C) INDIVIDDAL ISOLATE: GBH Peptide XlZl, Fig. 24 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: IS: 

Asp val Leu Ser Val Ala Phe Ser Ser Asp Asn Arg Gin He Val 
^5 10 



10 



25 



30 



15 



(2) INFORMATION FOR SEQ ID NO: 16: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

20 (iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 



15 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: GBH Peptide rIV, Pig. 24 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Trp val ser Cys Val Arg Phe Ser Pro Asn Ser Ser Asn Pro He 
^ 10 

(2) INFORMATION FOR SEQ ID NO: 17: 

35 (i) SEQUEJTCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 
(3) TC71Z: amino acid 
(D) TCI^OLCGV; unkncvm 

40 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

45 

(vi) ORIGINAL SOURCE: 
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(C) IiroiVIDUAIi IS0LAT2: GBH Peptide rV, Fig, 24 

{xi} SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

5 Tyr Leu Asn Thr Val Thr Val Ser Pro Asp Gly Ser Leu Cys Ala 

15 10 15 

(2) INFORMATION FOR SEQ ID NO: 18: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

15 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
(iv) ANTI*SENSE: NO 



20 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: GBH Peptide rVI, Fig. 24 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Thr Leu Asp Gly Gly Asp He He Asn Ala Leu Cys Phe Ser Pro 
15 10 15 

30 (2) INFORMATION FOR SEQ ID N0:19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1115 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE T"/PZ: TNA (genomic) 
40 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(vi) 

45 



ORIGINAL SOURCE: 
(C) INDIVIDUAL ISOLATE: RACKl DNA Sequence, Fig. lA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

GGCACGAGGG GTCGCGGTGG CAGCCGTGCG GTGCTTGGCT CCCTAAGCTA TCCGGTGCCA 
60 

5 

TCCTTGTCGC TGCGGCGACT CGCAACATCT GCAGCCATGA CCGAGCAAAT GACCCTTCGT 120 

GGGACCCTCA AGGGCCATAA TGGATGGGTT ACACAGATCG CCACCACTCC GCAGTTCCCG 180 

10 GACATGATCC TGTCGGCGTC TCGAGACAAG ACCATCATCA TGTGGAAGCT GACCAGGGAT 240 

GAGACCAACT ACGGCATACC ACAACGTGCT CTTCGAGGTC ACTCCCACTT TGTTAGCGAT 300 

GTTGTCATCT CCTCTGATGG CCAGTrTGCC CTCTCAGGCT CCTGGGATGG AACCCTACGC 360 

15 

CTCTGGGATC TCACAACGGG CACTACCACG AGACGATTTG TCGGCCACAC CAAGGATGTG 420 

CTGAGCGTGG CTTTCTCCTC TGACAACCGQ CAGATTGTCT CTGGGTCCCG AGACAAGACC 480 

20 ATTAAGTTAT GGAATACTCT GGGTGTCTGC AAGTACACTG TCCAGGATGA GAGTCATTCA 540 

GAATGGGTGT CTTGTGTCCG CTTCTCCCCG AACAGCAGCA ACCCTATCAT CGTCTCCTGC 600 

GGATGGGACA AGCTGGTCAA GGTGTGGAAT CTGGCTAACT GCAAGCTAAA GACCAACCAC 660 

25 

ATTGGCCACA CTGGCTATCT GAACACAGTG ACTGTCTCTC CAGATGGATC CCTCTGTGCT 720 

TCTGGAGGCA AGGATGGCCA GGCTATGCTG TGGGATCTCA ATGAAGGCAA GCACCTTTAC 780 

30 ACATTAGATG GTGGAGACAT CATCAATGCC TTGTGCTTCA GCCCCAACCG CTACTGGCTC 840 

TGTGCTGCCA CTGGCCCCAG TATCAAGATC TGGGACTTGO AGGGCAAGAT CATGGTAGAT 900 

GAACTGAAGC AAGAAGTTAT CAGCACCAGC AGCAAGGCAG AGCCACCCCA GTGTACCTCT 960 

35 

TTGGCTTGGT CTGCTGATGG CCAGACTCTG TTTGCTGGCT ATACCGACAA CTTGGTGCGT 1020 

GTATGGCAGG TCACTATTG3 TACCCGCT.\A AAGTTTATGA CAGACTCTTA GAAATAAACT 1080 

40 GGCTTTCTGA AATUUIAAAAA AAAAAAAAAA AAAAA 

(2) INFORMATION FOR SEQ ID NO:20: 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 96 base pairs 

(B) TYPE: nucleic acid 
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(C) STRAMDEDNESS: dcrJ^le 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

5 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

10 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: RACKl rl DNA Sequence, Fig. lA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

15 

GGCCATAATG GATGGGTTAC ACAGATCGCC ACCACTCCGC AGTTCCCGGA CATGATCCTG 
60 



TCGGCGTCTC GAGACAAGAC CATCATCATG TGGAAG 
20 96 

(2) INFORMATION FOR SEQ ID N0:21: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 94 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL; NO 
(iv) ANTI-SENSE: NO 

35 

(vi) ORIGINAL SOURCE: 

(C) IIIDIVIDUAL ISC!J\TE: RACXl rll DNA Sequence 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:21: 

GGTCACTCCC ACTTTGTTAG CGATGTTGTC ATCTCCTCTG ATGGCCAGTT TGCCCTCTCA 
60 

45 GGCTCCTGGG ATGGAACCCT ACGCCTCTGG GATC 
94 
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(2) INFCRM.\TICN FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 93 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxible 

(D) TOPOLOGY: linear 



10 



(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

15 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: RACKl rlll DNA Sequence, Fig. lA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

20 

GGCCACACCA AGGATGTGCT GAGCGTGGCT TTCTCCTCTG ACAACCGGCA GATTGTCTCT 
60 

GGGTCCCGAG ACAAGACCAT TAAGTTATGG AAT 
25 93 

(2) INFORMATION FOR SEQ ID NO:23: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 99 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

35 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) iryrcriiETrCAL: NO 



40 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: RACK! rIV DNA Sequence, Fig. lA 



45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
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ACTCATTC;.G AATCGSTuTC TTGtGrCCGC TTC7CCCCGA ACAGCAGCAA CCCTATCATC 
60 

GTCTCCTGCG GATGGGACAA GCTGGTCAAG GTGTGGAAT 
5 99 



(2) INFORMATION FOR SEQ ID N0:24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



15 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

20 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: RACKl rV DNA Sequence, Fig. lA 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

GGCCACACTG GCTATCTGAA CACAGTGACT GTCTCTCCAG ATGGATCCCT CTGTGCTTCT 
60 



30 GGAGGCAAGG ATGGCCAGGC TATGCTGTGG GAT 
93 



(2) INFORMATION FOR SEQ ID NO: 25: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 base pairs 

(B) TYPE: nucleic acid 

(C) ST?J\:or3iJES3 ; double 
(0) TOFOLCGV: lir.2Ar 

40 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



45 



(iv) ANTI" SENSE: NO 
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(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: RACKl rVI DMA Sequence, Pig. ia 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:25: 

TTAGATGGTG GAGACATCAT CAATGCCTTG TGCTTCAGCC CCAACCGCTA CTGGCTCTGT 
60 

10 GCTGCCACTG GCCCCAGTAT CAAGATCTGG GAC 
93 

(2) INFORMATION FOR SEQ ID NO:26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

25 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: RACKl rVII DNA Sequence, Fig. lA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 

AGCAAGGCAG AGCCACCCCA GTGTACCTCT TTGGCTTGGT CTGCTGATGG CCAGACTCTG 
60 

TTTGCTGGCT ATACC3ACAA CTTGGTGCGT GTATGGCAG 
93 

i2) i:;rOR:iAriGN for SEQ id N0:27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 317 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unJoxown 

(ii) MOLECULE TYPE: protein 



30 



35 



40 



45 
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(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

5 (Vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: RACKl Amino Acid Sequence, Fig. IC 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

10 

Met Thr Glu Gin Met Thr Leu Arg Gly Thr Leu Lys Gly His Asn Gly 
15 10 15 

Trp Val Thr Gin lie Ala Thr Thr Pro Gin Phe Pro Tlsp Met He Leu 
15 20 25 30 

Ser Ala Ser Arg Asp Lys Thr He He Met Trp Lys Leu Thr Arg Asp 
35 40 45 

20 Glu Thr Asn Tyr Gly He Pro Gin Arg Ala Leu Arg Gly His Ser His 

50 55 60 

Phe Val Ser Asp Val Val He Ser Ser Asp Gly Gin Phe Ala Leu Ser 
65 70 75 80 

25 

Gly Ser Trp Asp Gly Thr Leu Arg Leu Trp Asp Leu Thr Thr Gly Thr 
85 90 95 

Thr Thr Arg Arg Phe Val Gly His Thr Lys Asp Val Leu Ser Val Ala 
30 100 105 110 

Phe Ser Ser Asp Asn Arg Gin He Val Ser Gly Ser Arg Asp Lys Thr 
115 120 125 

35 He Lys Leu Trp Asn Thr Leu Gly Val Cys Lys Tyr Thr Val Gin Asp 

130 135 140 

Glu Ser Eis Ser Glu Trp V^i ner Cys Val Arg Phe Ser Pro Asn Ser 
145 150 155 ISO 

40 

Ser Asn Pro He He Val Ser Cys Gly Trp Asp Lys Leu Val Lys Val 
165 170 175 

Trp Asn Leu Ala Asn Cys Lys Leu Lys Thr Asn His He Gly His Thr 
45 180 185 190 



^^^•^^21252 FCr/US95A)1210 

- 75 - 

Gly Tyr Leu Asn Thr Val Thr Val Ser Pro Asp Gly Ser Leu Cys Ala 
155 200 205 

Ser Gly Gly Lys Asp Gly Gin Ala Met Leu Trp Asp Leu Asn Glu Gly 
5 210 215 220 

Lys His Leu Tyr Thr Leu Asp Gly Gly Asp He He Asn Ala Leu Cys 
230 235 240 

10 Phe Ser Pro Asn Arg Tyr Trp Leu Cys Ala Ala Thr Gly Pro Ser He 

245 250 255 

Lys He Trp Asp Leu Glu Gly Lys He He Val Asp Glu Leu Lys Gin 
2«0 265 270 

15 

Glu Val He Ser Thr Ser Ser Lys Ala Glu Pro Pro Gin Cys Thr Ser 
275 280 285 

Leu Ala Trp Ser Ala Asp Gly Gin Thr Leu Phe Ala Gly Tyr Thr Asp 
20 290 295 300 

Asn Leu Val Arg Val Trp Gin Val Thr He Gly Thr Arg 
305 310 315 



25 



(2) INFORMATION FOR SEQ ID NO: 28: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 501 amino acids 
30 (B) TYPE: amino acid 

(D) TOPOLOGY: \mknown 

(ii) MOLECULE TYPE: protein 

35 (iii) HYPOTHETICAL: NO 

(iv) Aim -SENSE: NO 



40 



45 



(vi) CAIGINAL SCrjRC3: 

(C) INDIVIDUAL ISOLATE: Human 55 kDa protein (PWP homo! : 
Fig. 11 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 

Met Asn Arg Ser Arg Gin Val Thr Cys Val Ala Trp Val Arg Cys Gly 
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1 5 10 15 

Val Ala Lys Glu Thr Pro Asp Lys Val Glu Leu Ser Lys Glu Glu Val 
20 25 30 

5 

Lys Arg Leu Zle Ala Glu Ala Lys Glu Lys Leu Gin Glu Glu Gly Gly 
35 40 45 

Gly Ser Asp Glu Glu Glu Thr Gly Ser Pro Ser Glu Asp Gly Met Gin 
10 50 55 60 

Ser Ala Arg Thr Gin Ala Arg Pro Arg Glu Pro Leu Glu Asp Gly Asp 
65 70 75 80 

15 Pro Glu Asp Asp Arg Thr Leu Asp Asp Asp Glu Leu Ala Glu Tyr Asp 

65 90 95 

Leu Asp Lys Tyr Asp Glu Glu Gly Asp Pro Asp Ala Glu Thr Leu Gly 
100 105 110 

20 

Glu Ser Leu Leu Gly lieu Thr Val Tyr Gly Ser Asn Asp Gin Asp Pro 
115 120 125 

Tyr Val Thr Leu Lys Asp Thr Glu Gin Tyr Glu Arg Glu Asp Phe Leu 
25 130 135 140 

lie Lys Pro Ser Asp Asn Leu lie Val Cys Gly Arg Ala Glu Gin Asp 
145 150 155 160 

30 Gin Cys Asn Leu Glu Val His Val Tyr Asn Gin Glu Glu Asp Ser Phe 

165 170 175 

Tyr Val His His Asp lie Leu Leu Ser Ala Tyr Pro Leu Ser Val Glu 
180 185 190 

35 

Trp Leu Asn Phe Asp Pro Ser Pro Asp Asp Ser Thr Gly Asn Tyr lie 
195 200 205 

Ala V^l Gly .\jn Mei r-r 7ro Vzl Glu Vil Trp Acy Liu Asp lie 
40 210 215 220 

Val Asp Ser Leu Glu Pro Val Phe Thr Leu Gly Ser Lys Leu Ser Lys 
225 230 235 240 



45 



Lys Lys Lys Lys Lys Gly Lys Lys Ser Ser Ser Ala Glu Gly His Thr 
245 250 255 
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Asp Ala Val Leu Asp L'2U Ser Trp Asn Lys Leu lie Arc; ?.2n Val I*eu 
260 265 270 

Ala Ser Ala Ser Ala Asp Asn Thr Val lie Leu Trp Asp Met Ser Leu 
5 275 280 285 

Gly Lys Pro Ala Ala Ser Leu Ala Val His Tlir Asp Lys Val Gin Thr 
290 295 300 

10 Leu Gin Phe His Pro Phe Glu Ala Gin Thr Leu lie Ser Gly Ser Tyr 

305 310 315 320 



15 



Asp Lys Ser Val Ala Leu Tyr Asp Cys Arg Ser Pro Asp Glu Ser His 

325 330 335 

Arg Met Trp Arg Phe Ser Gly Gin He Glu Arg Val Thr Trp Asn His 
340 345 350 



20 



Phe Ser Pro Cys His Phe Leu Ala Ser Thr Asp Asp Gly Phe Val Tyr 
355 360 365 



Asn Leu Asp Ala Arg Ser Asp Lys Pro He Phe Thr Leu Asn Ala His 
370 375 380 



25 



Asn Asp Glu He Ser Gly Leu Asp Leu Ser Ser Gin He Lys Gly Cys 
385 390 395 400 



30 



Leu Val Thr Ala Ser Ala Asp Lys Tyr Val Lys He Trp Asp He Leu 
405 410 415 

Gly Asp Arg Pro Ser Leu Val His Ser Arg Asp Met Lys Met Gly Val 
420 425 430 



35 



Leu Phe Cys Ser Ser Cys Cys Pro Asp Leu Pro Phe He Tyr Ala Phe 
435 440 445 



Gly Gly Gla Lys Glu Gly Leu Arg Val Trp Asp He Ser Thr Val Ser 
450 455 460 



40 



Ser Val Asn Glu Ala Phe Gly Arg Arg Glu Arg Leu Val Leu Gly Ser 
465 470 475 480 



45 



Ala Arg Asn Ser Ser He Ser Gly Pro Phe Gly Ser Arg Ser Ser Asp 
485 490 495 

Thr Pro Met Glu Ser 
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500 

(2) INFORiMATION FOR SEQ ID NO: 29: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 428 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: imlcnown 

10 . (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 



15 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: AAC-RICH protein, Fig. 12 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:29: 

Pro Gly Gly Phe Gin His Leu Gin Gin Gin Gin Gin Gin Gin Gin Gin 
15 10 15 

25 Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Thr Gin Val Gin 

20 25 30 

Gin Leu His Asn Gin Leu His Gin Gin His Asn Gin Gin lie Gin Gin 
35 40 45 

30 

Gin Ala Gin Ala Thr Gin Gin His Leu Gin Thr Gin Gin Tyr Leu Gin 
50 55 60 

Ser Gin lie His Gin Gin Ser Gin Gin Ser Gin Leu Ser Asn Asn Leu 
35 65 70 75 80 

T\sn Ser Asn Sar Lys Glu Ser Thr Asn He Pro Lys Thr Asn Thr Gin 
S3 90 95 

40 Tyr Thr Asn Phe Asp Ser Lys Asn Leu Asp Leu Ala Ser Arg Tyr 51; > 

100 105 110 

Ser Glu Cys Ser Thr Lys Asp Phe He Gly Asn Lys Lys Lys Ser Thr 
115 120 125 



45 



Ser Val Ala Trp Asn Ala Asn Gly Thr Lys He Ala Ser Ser Gly Ser 
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Asp Gly He Val Arg Val Trp Asn Phe Asp Pro Leu Gly Asn Ser Asn 
3 "5 «0 ISO 

Asn Asn Asn Asn Ser Asn Asn Thr Ser Ser Asn Ser Lys Asn Asn Asn 

170 j^^g 

He Lys Glu Thr lie Qlu Leu Lys Gly His Asp Gly Ser He Glu Lys 
" 180 185 „o 

He Ser Trp Ser Pro Lys Asn Asn Asp Leu Leu Ala Ser Ala Gly Thr 

200 205 

Asp Lys val He Lys He Trp Asp Val Lys He Gly Lys Cys He Gly 
2" 215 220 

Thr Val ser Thr Asn Ser Glu Asn He Asp Val Arg Trp Ser Pro Asp 



15 



20 



25 



30 



35 



40 



45 



225 230 235 



240 



Gly Asp His Leu Ala Leu He Asp Leu Pro Thr He Lys Thr Leu Lys 
2« 250 255 

He Tyr Lys Phe Asn Gly Glu Glu Leu Asn Gin Val Gly Trp Asp Asn 

265 270 

Asn Gly Asp Leu He Leu Met Ala Asn Ser Met Gly Asn He Glu Ala 
27S 280 285 

Tyr Lys Phe Leu Pro Lys Ser Thr Thr His Val Lys His Leu Lys Thr 
"° 295 300 

Leu Tyr Gly His Thr Ala Ser He Tyr Cys Met Glu Phe Asp Pro Thr 
Gly Lys Tyr Leu Ala Ala Gly Ser Ala Asp Ser He Val Ser Leu Trp 



325 



335 



-\sp ri3 A..p ::ct Met Cys Val L/i Thr Pha H^ Lvs Ts- Thr ''h- 

345 350 

Pro Cys Arg Ser Val Ser Phe Ser Phe Asp Gly Gin Phe He Ala Ala 

360 365 

Ser Ser Phe Glu Ser Thr He Glu He Phe His He Glu Ser Ser Cln 
"0 375 380 
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Pro He His Tbr He Glu Cya Gly Val'Ser Ser Leu Mat Tip His Pro 
385 390 395 400 

Thr Leu Pro Leu Leu Ala Tyr Ala Pro Glu Ser He Asn Glu Asn Asn 



5 405 410 

Lys Asp Pro Ser He Arg Val Phe Gly Tyr His Ser 
420 425 

10 (2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 517 amino acids 

(B) TYPE: amino acid 
2.5 (D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



20 



25 



30 



35 



40 



415 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: BETA TRCP, Pig. 13 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Met Glu Gly Phe Ser Cys Ser Leu Gin Pro Pro Thr Ala Ser Glu Arg 
15 10 15 

Glu Asp Cys Asn Arg Asp Glu Pro Pro Arg Lys He He Thr Glu Lys 
20 25 30 

Asn Thr Leu Arg Gin Thr Lys Leu Ala Asn Gly Thr Ser Ser Met He 
35 40 45 



Val Pro Lys Gin Arn Lys Leu S'ir XIa Asn Tvr Glu Lys Glu Lys 
5^ i5 60 



Glu 



45 



Leu Cys Val Lys Tyr Phe Glu Gin Trp Ser Glu Cys Asp Gin Val Glu 
^5 70 75 80 

Phe Val Glu His Leu He Ser Arg Met Cys His Tyr Gin His Gly His 
85 90 95 
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ne Asn Thr Tyr Leu Lya Pro Met Leu Gin Arg Asp Phe He Thr Ala 
100 105 110 

Leu Pro Ala Arg Gly Leu Asp His lie Ala Glu Asn He Leu Ser Tyr 
5 115 120 125 

Leu Asp Ala Lys Ser Leu Cys Ser Ala Glu Leu Val Cys Lys Glu Trp 
130 135 140 

10 Tyr Arg Val Thr Ser Asp Gly Met Leu Trp Lys Lys Leu He Glu Arg 

145 150 155 160 



15 



Met Val Arg Thr Asp Ser Leu Trp Arg Gly Leu Ala Glu Arg Arg Gly 
165 170 175 

Trp Gly Gin Tyr Leu Phe Lys Asn Lys Pro Pro Asp Gly Lys Thr Pro 
180 185 190 



20 



Pro Asn Ser Phe Tyr Arg Ala Leu Tyr Pro Lys He He Gin Asp He 
195 200 205 



Glu Thr He Glu Ser Asn Trp Arg Cys Gly Arg His Ser Leu Gin Arg 
210 215 220 

25 He His Cys Arg Ser Glu Thr Ser Lys Gly Val Tyr Cys Leu Gin Tyr 

225 230 235 240 



30 



Asp Asp Gin Lys He Val Ser Gly Leu Arg Asp Asn Thr He Lys He 
245 250 255 

Trp Asp Lys Asn Thr Leu Glu Cys Lys Arg Val Leu Met Gly His Thr 
260 265 270 



35 



Gly Ser Val Leu Cys Leu Gin Tyr Asp Glu Arg Val He He Thr Gly 
275 280 285 



3or A«p S?3r Thr V?.l Arg Val Trp Asp Val Asn Thr Gly Glu Met Leu 
2S0 295 300 



40 



Asn Thr Leu He His His Cys Glu Ala Val Leu His Leu Arg Phe Asn 
305 310 315 320 



45 



Asn Gly Met Met Val Thr Cys Ser Lys Asp Arg Ser He Ala Val Trp 
325 330 335 



Asp Met Ala Ser Ala Thr Asp He Thr Leu Arg Arg Val Leu Val Gly 
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340 345 350 

His Arg Ala Ala Val Asn Val Val Asp Phe Asp Asp Lys Tyr lie Val 
355 360 365 

5 

Ser Ala Ser Gly Asp Arg Thr lie Lys Val Trp Asn Thr Ser Thr Cys 
370 375 380 

Glu Phe Val Arg Thr Leu Asn Gly His Lys Arg Gly lie Ala Cys Leu 
10 385 390 395 400 

Gin Tyr Arg Asp Arg Leu Val Val Ser Gly Ser Ser Asp Asn Thr lie 
405 410 415 

15 Arg Leu Trp Asp He Glu Cys Gly Ala Cys Leu Arg Val Leu Glu Gly 

420 425 430 

His Glu Glu Leu Val Arg Cys He Arg Phe Asp Asn Lys Arg He Val 
435 440 445 

20 

Ser Gly Ala Tyr Asp Gly Lys He Lys Val Trp Asp Leu Val Ala Ala 
450 455 460 

Leu Asp Pro Arg Ala Pro Ala Gly Thr Leu Cys Leu Arg Thr Leu Val 
25 465 470 475 480 

Glu His Ser Gly Arg Val Phe Arg Leu Gin Phe Asp Glu Phe Gin He 
485 490 495 



30 



35 



Val Ser Ser Ser His Asp. Asp Thr He Leu He Trp Asp Phe Leu Asn 
500 505 510 

Asp Pro Gly Leu Ala 
515 

(2) INFORMATION FOR SEQ ID NO:31: 



(i) szQvz::cs ch:'>jiact::?.i£:tics : 

(A) LENGTH: 906 amino acids 
40 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



45 



(iii) HYPOTHETICAL: NO 
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(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: beta-prime- cop, Fig. 14 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ZD NO: 31: 

Met Pro Leu Arg Leu Asp lie Lys Arg Lys Leu Thr Ala Arg Ser Asp 
10 1 5 10 15 

Arg Val Lys Ser Val Asp Leu His Pro Thr Glu Pro Trp Met Leu Ala 
20 25 30 

15 Ser Leu Tyr Asn Gly Ser Val Cys Val Trp Asn His Glu Thr Gin Thr 

35 40 45 

Leu Val Lys Thr Phe Glu Val Cys Asp Leu Pro Val Arg Ala Ala Lys 
50 55 60 

20 

Phe Val Ala Arg Lys Asn Trp Val Val Thr Gly Ala Asp Asp Met Gin 
65 70 75 80 

lie Arg Val Phe Asn Tyr Asn Thr Leu Glu Arg Val His Met Phe Glu 
25 85 90 95 

Ala His Ser Asp Tyr He Arg Cys He Ala Val His Pro Thr Gin Pro 
100 105 110 

30 Phe He Leu Thr Ser Ser Asp Asp Met Leu He Lys Leu Trp Asp Trp 

115 120 125 

Asp Lys Lys Trp Ser Cys Ser Gin Val Phe Glu Gly His Thr His Tyr 
130 135 140 

35 

Val Mat Gin He Val He Asn Pro Lys Asp Asn Asn Gin Phe Ala Ser 
1^5 ISO 155 160 

.Ma 32r L2U Asp Arg Thr Ila Lys Vtl "".n J.au Gly Ser Ser Ser 

40 165 170 i:i 

Pro Asn Phe Thr Leu Glu Gly His Glu Lys Gly Val Asn Cys He A:rp 
180 185 190 



45 



Tyr Tyr Ser Gly Gly Asp Lys Pro Tyr Leu He Ser Gly Ala Asp Asp 
195 200 205 
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Arg Leu Val Lys He Trp Asp Tyr Gin Asn Lyo Thr Cys Val Gin Thr 
210 215 220 

Leu Glu Gly His Ala Gin Asn Val Ser Cys Ala Ser Phe His Pro Glu 
225 230 235 240 

Leu Pro He He He Thr Gly Ser Glu Asp Gly Thr Val Arg He Trp 
245 250 255 



10 



His Ser Ser Thr Tyr Arg Leu Glu Ser Thr Leu Asn Tyr Gly Met Glu 
260 265 270 



15 



Arg Val Trp Cys Val Ala Ser Leu Arg Gly Ser Asn Asn Val Ala Leu 
275 280 285 

Gly Tyr Asp Glu Gly Ser He He Val Lys Leu Gly Arg Glu Glu Pro 
290 295 300 



Ala Met Ser Met Asp Ala Asn Gly Lys He He Trp Ala Lys His Ser 
20 305 310 315 320 



Glu Val Gin Gin Ala Asn Leu Lys Ala Met Gly Asp Ala Glu He Lys 
325 330 



335 



25 



Asp Gly Glu Arg Leu Pro Leu Ala Val Lys Asp Met Gly Ser Cys Glu 
340 345 350 



30 



He Tyr Pro Gin Thr He Gin His Asn Pro Asn Gly Arg Phe Val Val 



355 



360 



365 



Val Cys Gly Asp Gly Glu Tyr He He Tyr Thr Ala Met Ala Leu Arg 
370 375 



380 



Asn Lys Ser Phe Gly Ser Ala Gin Glu Phe Ala Trp Ala His Asp Ser 
35 385 390 395 



400 



Ser Glu Tyr Ala lie Arg Glu Ser Asn Ser Val Val Lys He Phe Lys 
•*05 410 415 



40 



Asn Phe Lys Glu Lys Lys Ser Phe Lys Pro Asp Phe Gly Ala Glu Ser 
. 420 425 



430 



45 



He Tyr Gly Gly Phe Leu Leu Gly Val Arg Ser Val Asn Gly Leu Ala 
435 440 



445 



Phe Tyr Asp Trp Glu Asn Thr Glu Leu He Arg Arg He Glu He Gin 
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455 



460 



Pro Lys Hia lie Phe Trp Ser Asp Ser Gly Glu Leu Val Cys lie Ala 

465 470 475 480 

Thr Glu Glu Ser Phe Phe lie Leu Lys Tyr Leu Ser Glu Lys Val Leu 

485 490 495 



10 



Ala Ala Gin Glu Thr His Glu Gly Val Thr Glu Asp Gly lie Glu Asp 
500 505 510 



Gly Phe Glu Val Leu Gly Glu He Gin Glu He Val Lys Thr Gly Leu 
515 520 525 



15 



Trp Val Gly Asp Cys Phe He Tyr Thr Ser Ser Val Asn Arg Leu Asn 
530 535 540 



20 



Tyr Tyr Val Gly Gly Glu He Val Thr He Ala His Leu Asp Arg Thr 
545 550 555 560 

Met Tyr Leu Leu Gly Tyr He Pro Lys Asp Asn Arg Leu Tyr Leu Gly 
565 570 575 



25 



Asp Lys Glu Leu Asn He Val Ser Tyr Ser Leu Leu Val Ser Val Leu 
580 585 590 



Glu Tyr Gin Thr Ala Val Met Arg Arg Asp Phe Ser Met Ala Asp Lys 
595 600 605 



30 



Val Leu Pro Thr He Pro Lys Glu Gin Arg Thr Arg Val Ala His Phe 
610 615 620 



35 



Leu Glu Lys Gin Gly Phe Lys Gin Gin Ala Leu Thr Val Ser Thr Asp 

625 630 635 640 

Pro Glu His Arg Phe Glu Leu Ala Leu Gin Leu Gly Glu Leu Lys He 

645 650 655 



40 



kla. Cyr Gin L3U Ala Val Glu Ala G\u '3<ir Glu Gin Lys Trp Lys Gin 
660 665 670 



Leu Ala Glu Leu Ala He Ser Lys Cys Pro Phe Gly Leu Ala Gin Glu 
675 680 685 



45 



Cys Leu His His Ala Gin Asp Tyr Gly Gly Leu Leu Leu Leu Ala Thr 
690 695 700 
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10 



15 



20 



25 



30 



35 



- 8 



Ala Ser Gly Asn Ala Ser .Mat Val Asn Lys Leu Ala Glu Gly Ala Glu 

^05 7a0 71 e 

71S 

Arg Asp Gly Lye Asn Asn Val Ala Phe Met Ser Tyr Phe Leu Gin Gly 
725 730 

Lys Leu Asp Ala Cys Leu Glu Leu Leu He Arg Thr Gly Arg Leu Pro 



740 745 



7S0 



Glu Ala Ala Phe Leu Ala Arg Thr . Tyr Leu Pro Ser Gin Val Ser Arg 
755 760 

Val val Lys Leu Trp Arg Glu Asn Leu Ser Lys Val Asn Gin 



770 775 



Lys Ala 
780 



Ala Glu ser Leu Ala Asp Pro Thr Glu Tyr Glu Asn Leu Phe Pro Gly 

795 800 

Leu Lys Glu Ala Phe Val Val Glu Glu Trp Val Lys Glu Thr His Ala 
805 810 eiS 

ASP Leu Trp Pro Ala Lys Gin Tyr Pro Leu Val Thr Pro Asn Glu Glu 
820 825 830 

Arg Asn Val Met Glu Glu Ala Lys Gly Phe Gin Pro Ser Arg ser Ala 



835 840 



84S 



Ala Gin Gin Glu Leu Asp Gly Lys Pro Ala Ser Pro Thr Pro Val He 
"° 855 860 

val Thr ser Gin Thr Ala Asn Lys Glu Glu Lys Ser Leu Leu Glu Leu 
865 870 



875 



880 



Glu val Asp Leu Asp Asn Leu Glu He Glu Asp He Asp Thr Thr Asp 



885 



890 



895 



He Asa Leu Xsp Glu A 



sp He Leu Asp Asp 



^'^'O 9CS 



40 (2) INFORMATION FOR SEQ ID NO:32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 779 amino acids 

(B) TYPE: amino acid 
W TOPOLOGY: unknown 
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(ii) MOLECULE TYPE; protein 

(iii) HYPOTHETICAL: NO 

5 (iv) ANTI -SENSE: NO 

" (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: CDC4 / CDC20 protein, Pig, 15 

10 

(xi) SEQUENCE. DESCRIPTION: SEQ ID NO:32: 

Met Gly Ser Phe Pro Leu Ala Glu Phe Pro Leu Arg Asp He Pro Val 
15 10 15 

15 

Pro Tyr Ser Tyr Arg Val Ser Gly Gly He Ala Ser Ser Gly Ser Val 
20 25 30 

Thr Ala Leu Val Thr Ala Ala Gly Thr His Arg Asn Ser Ser Thr Ala 
20 35 40 45 

Lys Thr Val Glu Thr Glu Asp Gly Glu Glu Asp lie Asp Glu Tyr Gin 
50 55 60 

25 Arg Lys Arg Ala Ala Gly Ser Gly Glu Ser Thr Pro Glu Arg Ser Asp 

65 70 75 80 

Phe Lys Arg Val Lys His Asp Asn His Lys Thr Leu His Pro Val Asn 
85 90 95 

30 

Leu Gin Asn Thr Gly Ala Ala Ser Val Asp Asn Asp Gly Leu His Asn 
100 105 110 

Leu Thr Asp He Ser Asn Asp Ala Glu Lys Leu Leu Met Ser Val Asp 
35 115 120 125 

Asp Gly Ssr Ala Ala Pro Ser Thr Leu Ser Val Asn Met Gly Val Ala 
130 125 140 

40 Ser His Asn Val Ala Ala Pro Thr Thr Val Asn Ala Ala Thr Il-u t: r 

145 150 155 ^bO 

Gly Ser Asp Val Ser Asn Asn Val Asn Ser Ala Thr He Asn Asn Pro 
165 170 175 



45 



Met Glu Glu Gly Ala Leu Pro Leu Ser Pro Thr Ala Ser Ser Pro Gly 



wo 95/21252 



PCT/US95.'01210 



180 



165 



190 



Thr Thr Thr Pro Leu Ala Lys Thr Thr Lys Thr He Asn Asn Asn Asn 
195 200 205 

S 

Asn lie Ala Asp Leu He Glu Ser Lys Asp Ser He He Ser Pro Glu 
210 215 220 

Tyr Leu Ser Asp Glu He Phe Ser Ala He Asn Asn Asn Leu Pro His 
" 225 230 235 240 

Ala Tyr Phe Lys Asn Leu Leu Phe Arg Leu Val Ala Asn Met Asp Arg 
245 250 255 



15 



Ser Glu Leu Ser Asp Leu Gly Thr Leu He Lys Asp Asn Leu Lys Arg 
260 265 



270 



20 



Asp Leu He Thr Ser Leu Pro Phe Glu He Ser Leu Lys He Phe Asn 
275 280 285 

Tyr Leu Gin Phe Glu Asp He He Asn Ser Leu Gly Val Ser Gin Asn 
290 295 300 



Trp Asn Lys He He Arg Lys Ser Thr Ser Leu Trp Lys Lys Leu Leu 
25 305 310 315 320 

He Ser Glu Asn Phe Val Ser Pro Lys Gly Phe Asn Ser Leu Asn Leu 
325 330 



30 



Lys Leu Ser Gin Lys Tyr Pro Lys Leu Ser Gin Gin Asp Arg Leu Arg 
340 345 



350 



35 



Leu Ser Phe Leu Glu Asn He Phe He Leu Lys Asn Trp Tyr Asn Pro 
355 360 365 

Lys Phe Val Pro Gin Arg Thr Thr Leu Arg Gly His Met Thr Ser Val 
370 375 380 



40 



11-3 Thr C/j Lz-j. Jin S'.-:'- Clu .Mp ryr Val II- Thr Gly Ala As 
385 390 



395 



p 
400 



Asp Lys Met He Arg Val Tyr Asp Ser He Asn Lys Lys Phe Leu Leu 
405 410 



45 



Gin Leu Ser Gly His Asp Gly Gly Val Trp Ala Leu Lys Tyr Ala His 
420 425 



430 
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Gly Gly lie Leu Val Ser Gly Ser Thr Asp Arg Tlir Val Arg Val Trp 
435 440 445 

Asp He Lys Lys Gly Cys Cys Thr His Val Phe Glu Gly His Asn Ser 
5 450 455 460 

Thr Val Arg Cys Leu Asp He Val Glu Tyr Lys Asn He Lys Tyr He 
465 470 475 480 

10 Val Thr Gly Ser Arg Asp Asn Thr Leu His Val Trp Lys Leu Pro Lys 

485 490 495 

Glu Ser Ser Val Pro Asp His Gly Glu Glu His Asp Tyr Pro Leu Val 
500 505 510 

15 

Phe His Thr Pro Glu Glu Asn Pro Tyr Phe Val Gly Val Leu Arg Gly 
515 520 525 

His Met Ala Ser Val Arg Thr Val Ser Gly His Gly Asn He Val Val 
20 530 535 540 

Ser Gly Ser Tyr Asp Asn Thr Leu He Val Trp Asp Val Ala Gin Met 
545 550 555 560 

25 Lys Cys Leu Tyr He Leu Ser Gly His Thr Asp Arg He Tyr Ser Thr 

565 570 575 

He Tyr Asp His Glu Arg Lys Arg Cys He Ser Ala Ser Met Asp Thr 
580 585 590 

30 

Thr He Arg He Trp Asp Leu Glu Asn He Trp Asn Asri Gly Glu Cys 
595 600 605 

Ser Tyr Ala Thr Asn Ser Ala Ser Pro Cys Ala Lys He Leu Gly Ala 
35 610 615 620 

Met Tyr Thr Leu Gin Gly His Thr Ala Leu Val Gly Leu Leu Arg Leu 
630 633 640 



40 



45 



Ser Asp Lys Phe Leu Val Ser Ala Ala Ala Asp Gly Ser He Arg . . • 
645 650 655 

Trp Asp Ala Asn Asp Tyr Ser Arg Lys Phe Ser Tyr His His Thr Asn 
660 665 670 

Leu Ser Ala He Thr Thr Phe Tyr Val Ser Asp Asn He Leu Val Ser 
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675 HSO 685 

Gly Ser Glu Asn Gin Phe Asn He Tyr Asn Leu Arg Ser Gly Lys Leu 
690 695 700 

5 

Val His Ala Asn He Leu Lys Asp Ala Asp Gin He Trp Ser Val Asn 
705 710 715 720 

Phe Lys Gly Lys Thr Leu Val Ala Ala Val Glu Lys Asp Gly Gin Ser 
10 725 730 735 

Phe Leu Glu He Leu Asp Phe Ser Lys Ala Ser Lys He Asn Tyr Val 
740 745 750 

15 Ser Asn Pro Val Asn Ser Ser Ser Ser Ser Leu Glu Ser He Ser Thr 

755 760 765 

Ser Leu Gly Leu Thr Arg Thr Thr He He Pro 
770 775 



20 



(2) INFORMATION FOR SEQ ID NO: 33: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 318 amino acids 
25 (B) TYPE: amino acid 

(D) TOPOLOGY: un3aiown 

(ii) MOLECULE TYPE: protein 

30 (iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 
35 (C) INDIVIDUAL ISOLATE: GBLP -CHLAMIDOMONAS HOMOLOG, Fig. 16 

(xi) SEQUENCE DSSCRI?riC:i: SEQ ID NO: 33: 



40 



45 



Met Ala Glu Thr Leu Thr Leu Arg Ala Thr Leu Lys Gly His Thr Asn 

Trp Val Thr Ala He Ala Thr Pro Leu Asp Pro Ser Ser Asn Thr Leu 
20 25 30 

Leu Ser Ala Ser Arg Asp Lys Ser Val Leu Val Trp Glu Leu Glu Arg 
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10 



15 



20 



25 



30 



35 



40 



45 



35 



40 



45 



Ser Glu Ser Asn Tyr Gly Tyr Ala Arg Lys Ala Leu Arg Gly His Ser 



50 



55 



60 



His Phe Val Gin Asp Val Val He Ser Ser Asp Gly Gin Phe Cys Leu 

75 80 

Thr Gly Ser Trp Asp Gly Thr Leu Arg Leu Trp Asp Leu Asn Thr Gly 
85 90 95 

Thr Thr Thr Arg Arg Phe Val Gly His Thr Lys Asp Val Leu Ser Val 
100 105 

Ala Phe ser Val Asp Asn Arg Gin lie Val Ser Gly Ser Arg Asp Lys 

"0 125 

Thr He Lys Leu Trp Asn Thr Leu Gly Glu Cys Lys Tyr Thr He Gly 
"° "5 140 

Glu Pro Glu Gly His Thr Glu Trp Val Ser Cys Val Arg Phe Ser Pro 

Met Thr Thr Asn Pro He He Val Ser Gly Gly Trp Asp Lys Met Val 
1« 170 1,5 

Lys Val Trp Asn Leu Thr Asn Cys Lys Leu Lys Asn Asn Leu Val Gly 



180 



185 



190 



His His Gly Tyr Val Asn Thr Val Thr Val Ser Pro Asp Gly Ser Leu 



195 



200 



205 



Cys Ala ser Gly Gly Lys Asp Gly He Ala Met Leu Trp Asp Leu 



210 



215 



Ala 



220 



Glu Gly Lys Arg Leu Tyr Ser Leu Asp Ala Gly Asp Val He His Cys 

Cys :-r., 3er :>ro Asn Arg Tyr Trp L^u Cys Ala A- a Thr Gin S-^ 
245 250 255 

ser He Lys He Trp Asp Leu Glu Ser Lys Ser He Val Asp Asp Leu 



260 



265 



270 



Arg Pro Glu Phe Asn He Thr Ser Lys Lys Ala Gin Val Pro Tyr Cys 
275 280 



285 
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Val Ser Leu Ala Trp Ser Ala Asp Gly Ser Thr Leu Tyr Ser Gly Tyr 
290 295 

Thr Asp Gly Gin He Arg Val Trp Ala Val Gly His Ser Leu 
5 305 310 315 

(2) INFORMATION FOR SEQ ID NO:34: 

(i) SEQUENCE CHARACTERISTICS: 
^0 (A) LENGTH: 658 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



15 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

20 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: cop-1 protein. Fig. 17 



25 



35 



40 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Met Qlu Glu He Ser Thr Asp Pro Val Val Pro Ala Val Lys Pro Asp 



10 



IS 



Pro Arg Thr Ser Ser Val Gly Glu Gly Ala Asn Arg His Glu Asn Asp 



30 20 25 



30 



Asp Gly Gly Ser Gly Gly Ser Glu He Gly Ala Pro Asp Leu Asp Lys 
35 40 45 

Asp Leu Leu Cys Pro He Cys Met Gin He He Lys Asp Ala Phe Leu 

55 60 

Thr Ala C/3 Gly Kis Ser Phe Cys Tyr Ket Cys He He Thr His Leu 



65 70 



SO 



Arg Asn Lys Ser Asp Cys Pro Cys Cys Ser Gin His Leu Thr Asn Asn 
85 90 95 

Gin Leu Tyr Pro Asn Phe Leu Leu Asp Lys Leu Leu Lys Lys Thr Ser 
• 105 110 
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Ala Arg His Val Ser Lys Thr Ala Ser Pro Leu Asp Gin Phe Arg Glu 
1« '120 125 

Ala Leu Gin Arg Gly Cys Asp Val Ser He Lys Glu Val Asp Asn Leu 
5 130 135 140 

Leu Thr Leu Leu Ala Glu Arg Lys Arg Lys Met Glu Gin Glu Glu Ala 
"5 150 155 3^go 

10 Glu Arg Asn Met Gin He Leu Leu Asp Phe Leu His Cys Leu Arg Lys 

IfiS 170 

Gin Lys Val Asp Glu Leu Asn Glu Val Gin Thr Asp Leu Gin Tyr He 
"° 185 190 

15 

Lys Glu Asp lie Asn Ala Val Glu Arg His Arg He Asp Leu Tyr Arg 
195 200 205 

Ala Arg Asp Arg Tyr Ser Val Lys Leu Arg Met Leu Gly Asp Asp Pro 
20 210 215 220 

Ser Thr Arg Asn Ala Trp Pro His Glu Lys Asn Gin He Gly Phe Asn 



25 



30 



40 



45 



225 230 235 



240 



Ser Asn Ser Leu Ser He Arg Gly Gly Asn Phe Val Gly Asn Tyr Gin 

250 255 



Asn Lys Lys Val Glu Gly Lys Ala Gin Gly Ser Ser His Gly Leu Pro 

270 



260 265 



Lys Lys Asp Ala Leu Ser Gly Ser Asp Ser Gin Ser Leu Asn Gin Ser 
275 280 285 

Thr val ser Met Ala Arg Lys Lys Arg He His Ala Gin Phe Asn Asp 



35 290 295 



300 



Leu Gin Glu Cys Tyr Leu Gin Lys Arg Arg Gin Leu Ala Asp Gin Pro 



ji^S 310 3j^s 



320 



Asn Ser Lys Gin Glu Asn Asp Lys Ser Val Val Arg Arg Glu Gly Tyr 
"5 330 335 

Ser Asn Gly Leu Ala Asp Phe Gin Ser Val Leu Thr Thr Phe Thr Arg 
3*0 345 350 



Tyr Ser Arg Leu Arg Val He Ala Glu He Arg His Gly Asp He 



Phe 
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,04 



355 



360 



365 



His Ser Ala Asn lie Val Ser Ser lie Glu Phe Asp Arg Asp Asp Glu 
370 375 380 

Leu Phe Ala Thr Ala Gly Val Ser Arg Cys lie Lys Val Phe Asp Phe 
365 390 395 400 



10 



Ser Ser Val Val Asn Glu Pro Ala Asp Met Gin Cys Pro lie Val Glu 
405 410 415 



Met Ser Thr Arg Ser Lys Leu Ser Cys Leu Ser Trp Asn Lys His Glu 
420 425 430 



15 



Lys Asn His lie Ala Ser Ser Asp Tyr Glu Gly lie Val Thr Val Trp 
435 440 445 



20 



Asp Val Thr Thr Arg Gin Ser Leu Met Glu Thr Glu Glu Asn Glu Lys 
450 455 460 

Arg Ala Trp Ser Val Asp Phe Ser Arg Thr Glu Pro Ser Met Leu Val 
465 470 475 480 



25 



Ser Gly Ser Asp Asp Cys Lys Val Lys Val Trp Cys Thr Arg Gin Glu 
485 490 495 



Ala Ser Val He Asn He Asp Met Lys Ala Asn He Cys Cys Val Lys 
500 505 510 



30 



Tyr Asn Pro. Gly Ser Ser Asn Tyr He Ala Val Gly Ser Ala Asp His 
515 520 525 



35 



His He His Tyr Tyr Asp Leu Arg Asn He Ser Gin Pro Leu His Val 
530 535 540 

Phe Ser Gly His Lys Lys Ala Val Ser Tyr Met Lys Phe Leu Ser Asn 
5-45 530 5=5 560 



40 



Asn Glu -^u Ala 3er Ala Z^r Tlir .\jp Ser Thr Z.iu Arg Leu Trp r-sp 
565 570 575 



Val Lys Asp Asn Leu Pro Val Arg Thr Phe Arg Gly His Thr Asn Glu 
580 585 590 



45 



Lys Asn Phe Val Gly Leu Thr Val Asn Ser Glu Tyr Leu Ala Cys Gly 
595 600 605 



•^^'^"^2 Pcr/usDS/onio 

" 55 - 

ser Glu Thr Thr Arg Tyr VaX Tyr His Lys Glu He Thr Arg Pro Val 

615 520 

Thr Ser His Arg Phe Gly Ser Pro Asp Met Asp Asp Ala Glu Lys Arg 
^ "0 635 640 

Gin val Pro Thr Leu Leu Val Arg Phe Ala Gly Arg Val He Val Pro 

650 

10 Arg Cys 



(2) INFORMATION FOR SEQ ID NO: 35: 

^5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 440 amino acids 

(B) TYPE: amino, acid 
(D) TOPOLOGY: unJcnown 

20 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

25 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: CORO PROTEIN, Fig. 18 

(^i) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

Met Ser Lys Val Val Arg Ser Ser Lys Tyr Arg His Val Phe Ala Ala 



10 



15 



35 



Gin Pro Lys Lys Glu Glu Cys Tyr Gin Asn Leu Lys Thr Lys Ser Ala 
2° 25 30 

Val Trp Asp Ser Asn Tyr Val Ala Ala Asn Thr Arg Tyr lie Trp Asp 

40 i5 

Ala Ala Gly Gly Gly Ser Phe Ala Val Glu Ala He Pro His Ser Gl^- 
50 55 60 

Lys Thr Thr Ser Val Pro Leu Phe Asn Gly His Lys Ser Ala Val Leu 
70 75 80 



40 



Asp He Ala Fhe Kis Pro Kie Aca Glu Aan Leu Val Gly Ser Val Ser 

Glu Asp cys Asn He Cys He Trp Gly He Pro Glu Gly Gly Leu Thr 
100 105 

Asp Ser He Ser Thr Pro Leu Gin Thr Leu Ser Gly His Lys Arg Lys 

120 125 



10 



15 



20 



25 



30 



35 



40 



45 



Val Gly Thr He Ser Phe Gly Pro Val Ala Asp Asn Val Ala Val Thr 
130 135 140 

ser Ser Gly Asp Phe Leu Val Lys Thr Trp Asp Val Glu Gin Gly Lys 
"0 155 160 

Asn Leu Thr Thr Val Glu Gly His Ser Asp Met He Thr Ser Cys Glu 
"5 170 175 

His Asn Gly Ser Gin He Val Thr Thr Cys Lys Asp Lys Lys Ala Arg 
"0 185 190 

Val Phe Asp Pro Arg Thr Asn Ser He Val Asn Glu Val Val Cys His 
195 200 205 

Gin Gly Val Lys Asn Ser Arg Ala He Phe Ala Lys Asp Lys Val He 
210 215 220 

Thr val Gly Phe Ser Lys Thr Ser Glu Arg Glu Leu His He Tyr Asp 
230 235 240 

Pro Arg Ala Phe Thr Thr Pro Leu Ser Ala Gin Val Val Asp Ser Ala 
2« 250 255 



ser Gly Leu Leu Met Pro Phe Tyr Asp Ala Asp Asn, Ser He Leu Tyr 
2" 265 270 

Leu Ala Gly Lys Gly Asp Gly A^n He Arg Tyr Tyr Glu Leu Val Asp 



2S0 



285 



Glu ser Pro Tyr He His Phe Leu Ser Glu Phe Lys Ser Ala Thr 
290 295 300 

Gin Arg Gly Leu Cys Phe Leu Pro Lys Arg Cys Leu Asn Thr Ser Glu 
310 315 320 

Cys Glu He Ala Arg Gly Leu Lys Val Thr Pro Phe Thr Val Glu Pro 
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325 '330 335 

lie Ser Phe Arg Val Pro Arg Lys Ser Asp He Phe Gin Gly Asp He 
340 345 350 

5 

Tyr Pro Asp Thr Tyr Ala Gly Glu Pro Ser Leu Thr Ala Glu Gin Trp 
355 360 365 

Val Ser Gly Thr Asn Ala Glu Pro Lys Thr Val Ser Leu Ala Gly Gly 
^0 370 375 380 

Phe Val Lys Lys Ala Ser Ala Val Glu Phe Lys Pro Val Val Gin Val 
390 395 400 



15 



Gin Glu Gly Pro Lys Asn Glu Lys Glu Leu Arg Glu Glu Tyr Glu Lys 
405 410 

Leu Lys He Arg Val Ala Tyr Leu Glu Ser Glu He Val Lys Lys Asp 



20 



420 425 

Ala Lys He Lys Glu Leu Thr Asn 
435 440 



(2) INFORMATION FOR SEQ ID NO: 36: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 445 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

30 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
35 (iv) ANTI-SENSE: NO 



430 



40 



45 



(vi) ORIGINAL SOURCE: 

(C) irar/IDUAL ISOLATE: Corcnin (?55) , Fig. 19 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Met Ser Lys Val Val Arg Ser Ser Lys Tyr Arg His Val Phe Ala Ala 
^5 10 15 

Gin Pro Lys Lys Glu Glu Cys Tyr Gin Asn Leu Lys Val Thr Lys Ser 
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20 



25 



30 



Ala Txp Asp Ser Asn Tyr Val Ala Ala Asn Thr Arg Tyr Phe Gly Val 
35 40 45 

lie Trp Asp Ala Ala Gly Gly Gly Ser Phe Ala Val lie Pro His Glu 
50 55 60 



10 



Ala Ser Gly Lys Thr Thr Ser Val Pro Leu Phe Asn Gly His Lys Ser 
€5 70 75 80 



Ala Val Leu Asp He Ala Phe His Pro Phe Asn Glu Asn Leu Val Gly 
B5 90 95 

15 Ser Val Ser Glu Asp Cys Asn He Cys He Trp Gly He Pro Glu Gly 

100 105 110 



20 



Gly Leu Thr Asp Ser He Ser Thr Pro Leu Gin Thr Leu Ser Gly His 
115 120 125 

Lys Arg Lys Val Gly Thr He Ser Phe Gly Pro Val Ala Asp Asn Val 

130 135 140 



25 



Ala Val Thr Ser Ser Gly Asp Phe Leu Val Lys Thr Trp Asp Val Glu 
145 150 155 160 



Gin Gly Lys Asn Leu Thr Thr Val Glu Gly His Ser Asp Met He Thr 
165 170 175 

30 Ser Cys Glu Trp Asn His Asn Gly Ser Gin He Val Thr Thr Cys Lys 

180 185 190 



35 



Asp Lys Lys Ala Arg Val Phe Asp Pro Arg Thr Asn Ser He Val Asn 
195 200 205 

Glu Val Val Cys His Gin Gly Val Lys Asn Ser Arg Ala He Phe Ala 
210 215 220 



40 



Lys Asp Lys Val 11^ riir :iv iFhs Ser Lys Thr Ser Giu Arg Glu 

225 230 235 240 



Leu His He Tyr Asp Pro Arg Ala Phe Thr Thr Pro Leu Ser Ala Gin 
245 250 255 



45 



Val Val Asp Ser Ala Ser Gly Leu Leu Met Pro Phe Tyr Asp Ala Asp 
260 265 270 
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Asn Ser He Leu Tyr Leu Ala Gly Lys Gly Asp Gly Asn He Arg Tyr 
275 280 285 

Tyr Glu Leu Val Asp Glu Ser Pro Tyr He His Phe Leu Ser Glu Phe 
5 290 295 300 

Lys Ser Ala Thr Pro Gin Arg Gly Leu Cys Phe Leu Pro Lys Arg Cys 
305 310 315 320 

10 Leu Asn Thr Ser Glu Cys Glu He Ala Arg Gly Leu Lys Val Thr Pro 

325 330 335 

Phe Thr Val Glu Pro He Ser Phe Arg Val Pro Arg Lys Ser Asp He 
340 345 350 

15 

Phe Gin Gly Asp He Tyr Pro Asp Thr Tyr Ala Gly Glu Pro Ser Leu 
355 360 365 

Thr Ala Glu Gin Trp Val Ser Gly Thr Asn Ala Glu Pro Lys Thr Val 
20 370 375 380 

Ser Leu Ala Gly Gly Phe Val Lys Lys Ala Ser Ala Val Glu Phe Lys 
385 390 395 400 

25 Pro Val Val Gin Val Gin Glu Gly Pro Lys Asn Glu Lys Glu Leu Arg 

405 410 415 

Glu Glu Tyr Glu Lys Leu Lys He Arg Val Ala Tyr Leu Glu Ser Glu 
420 425 430 

30 

He Val Lys Lys Asp Ala Lys He Lys Glu Leu Thr Asn 
435 440 445 

(2) INFORMATION FOR SEQ ID NO: 37: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 431 amino acids 
(E) TY?E: a-nino acid 
[Z) TG70L0G?: vjiltnc*.;:^ 

40 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 



45 



(iv) ANTI- SENSE: NO 
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30 



40 



- 100 

(vi) ORIGINAL SOURCE: 



(C) INDIVIDimL ISOLATE: CS-T SOkDa, Pig. 20 



5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: 

Met Tyr Arg Thr Lya Val Gly Leu Lya Asp Arg Gin Gin Leu Tyr Lys 
1 . 5 _ 10 15 



10 Leu He He Ser Gin Leu Leu Tyr Asp Gly Tyr He Ser He Ala Asn 

20 25 30 

Gly Leu He Asn Glu He Lys Pro Gin Ser Val Cys Ala Pro Ser Glu 
35 40 45 

15 

Gin Leu Leu His Leu He Lys Leu Gly Met Glu Asn Asp Asp Thr Ala 
50 55 60 

Val Gin Tyr Ala He Gly Arg Ser Asp Thr Val Ala Pro Gly Thr Gly 
20 ^5 70 75 80 

He Asp Leu Glu Phe Asp Ala Asp Val Gin Thr Met Ser Pro Glu Ala 
85 90 95 

25 Ser Glu Tyr Glu Thr Cys Tyr Val Thr Ser His Lys Gly Pro Cys Arg 

100 105 110 

Val Ala Thr Tyr Ser Arg Asp Gly Gin Leu He Ala Thr Gly ser Ala 
115 120 125 

Asp Ala Ser He Lys He Leu Asp Thr Glu Arg Met Leu Ala Lys Ser 
130 135 140 

Ala Met Pro He Glu Val Met Met Asn Glu Thr Ala Gin Gin Asn Met 
^« 150 155 160 

Glu Asn His Pro Val He Arg Thr Leu Tyr Asp His Val Asp Glu Val 
155 170 175 



Thr Cys Leu Ala Phe His Pro Thr Glu Gin He Leu Ala Ser Gly •; 
180 185 190 

Arg Asp Tyr Thr Leu Lys Leu Phe Asp Tyr Ser Lys Pro Ser Ala Lys 
155 200 205 



45 



Arg Ala Phe Lys Tyr He Gin Glu Ala Glu Met Leu Arg Ser He Ser 
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210 215 220 

Phe His Pro Ser Gly Asp Phe lie Leu Val Gly Thr Gin His Pro Thr 
225 230 235 240 

5 

Leu Arg Leu Tyr Asp lie Asn Thr Phe Gin Cys Phe Val Ser Cys Asn 
245 250 255 

Pro Gin Asp Gin His Thr Asp Ala lie Cys Ser Val Asn Tyr Asn Ser 
10 260 265 270 

Ser Ala Asn Met Tyr Val Thr Gly Ser Lys Asp Gly Cys lie Lys Leu 
275 260 285 

15 Trp Asp Gly Val Ser Asn Arg Cys He Thr Thr Phe Glu Lys Ala His 

290 295 300 

Asp Gly Ala Glu Val Cys Ser Ala He Phe Ser Lys Asn Ser Lys Tyr 
305 310 315 320 

20 

He Leu Ser Ser Gly Lys Asp Ser Val Ala Lys Leu Trp Glu lie Ser 
325 330 335 

Thr Gly Arg Thr Leu Val Arg Tyr Thr Gly Ala Gly Leu Ser Gly Arg 
25 340 345 350 

Gin Val His Arg Thr Gin Ala Val Phe Asn His Thr Glu Asp Tyr Val 
355 360 365 

30 Leu Leu Pro Asp Glu Arg Thr He Ser Leu Cys Cys Trp Asp Ser Arg 

370 375 380 

Thr Ala Glu Arg Arg Asn Leu Leu Ser Leu Gly His Asn Asn He Val 
385 390 395 400 

35 

Arg Cys He Val His Ser Pro Thr Asn Pro Gly Phe Met Thr Cys Ser 
405 410 415 

Asp .\-^j> Xr- Aln \vj Fh-i Trp Tyr Arrr At? r. Thr Thr Asp 
40 420 425 430 

(2) INFORMATION FOR SEQ ID NO: 38: 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 340 amino acids 

(B) TYPE: amino acid 
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(C) T0P0IXX3Y: unknovna 

(ii) MOLECULE TYPE: protein 

5 (iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 
0 (C) INDIVIDUAL ISOLATE: G-Beta 1 bovine. Fig. 21 



15 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

Met Ser Glu Leu Asp Gin Leu Arg Gin Glu Ala Glu Gin Leu Lys Asn 
15 10 15 

Gin He Arg Asp Ala Arg Lys Ala Cys Ala Asp Ala Thr Leu Ser Gin 
20 25 30 

He Thr Asn Asn He Asp Pro Val Gly Arg He Gin Met Arg Thr Arg 
35 40 45 

Arg Thr Leu Arg Gly His Leu Ala Lys He Tyr Ala Met His Trp Gly 
25 50 55 60 

Thr Asp Ser Arg Leu Leu Val Ser Ala Ser Gin Asp Gly Lys Leu He 
^5 70 75 80 

30 He Trp Asp Ser Tyr Thr Thr Asn Lys Val His Ala He Pro Leu Arg 

85 90 95 

Ser Ser Trp Val Met Thr Cys Ala Tyr Ala Pro Ser Gly Asn Tyr Val 

105 110 

35 

Ala Cys Gly Gly Leu Asp Asn He Cys Ser He Tyr Asn Leu Lys Thr 
1-5 120 125 

Arg Glu Gly Asn Val Arg Val Ser Arg Glu Leu Ala Gly :'i3 T^- r.^y 
^° "0 135 140 

Tyr Leu Ser Cys Cys Arg Phe Leu Asp Asp Asn Gin He Val Thr 

150 155 160 



45 



Ser Gly Asp Thr Thr Cys Ala Leu Trp Asp He Glu Thr Gly Gin Gin 
165 170 175 



i 
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Thr Thr Phe Thr Gly His Thr Gly Asp Val Met Ser Leu Ser Leu 
180 IBS 

Ala Pro Asp Thr Arg Leu Phe Val Ser Gly Ala Cys Asp Ala Ser Ala 
5 195 200 205 

Lys Leu Trp Asp Val Arg Glu Gly Met Cys Arg Gin Thr Phe Thr Gly 
2" 215 220 



10 



15 



20 



25 



30 



His Glu Ser Asp He Aan Ala lie Cys Phe Phe Pro Asn Gly Asn Ala 
230 235 240 

Phe Ala Thr Gly Ser Asp Asp Ala Thr Cys Arg Leu Phe Asp Leu Arg 
245 250 255 

Ala Asp Gin Glu Leu Met Thr Tyr Ser His Asp Asn He He Cys Gly 

2« 270 

He Thr Ser Val Ser Phe Ser Lys Ser Gly Arg Leu Leu Leu Ala Gly 

280 285 

Tyr Asp Asp Phe Asn Cys Asn Val Trp Asp Ala Leu Lys Ala Asp Arg 
. 295 



Ala Gly val Leu Ala Gly His Asp Asn Arg Val Ser Cys Leu Gly Val 

320 

Thr ASP ASP Gly Met Ala Val Ala Thr Gly ser Trp Asp Ser Phe Leu 

335 



325 330 



Lys He Trp Aan 
340 

(2) INFORMATION FOR SEQ ID NO: 39: 

35 

(i) SSQUZ:JC3 CHARACTERISTICS: 

(A) LSM'^TH: 326 aTtiina acids 
(S) r/PS: amino acid 
;-•) r;.-C:,OG'i : :ir.'.inc:!ti 

40 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 



45 



(iv) ANTI- SENSE: NO 



PCT/US95/01210 
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(vi) 0RIGI2TAI. SOURCE: 

(C) INDIVIDOAL ISOLATE: G-Beta- bovine (2). Pig. 22 

5 (xi) SEQTXENCE DESCRIPTION: SEQ ID NO: 39: 

Arg Asn Gin lie Arg Asp Ala Arg Lys Ala Cys Gly Asp Ser Thr Leu 



10 



15 



25 



30 



40 



45 



10 



15 



Thr Gin He Thr Ala Gly Leu Asp Pro Val Gly Arg He Gin Met Arg 
20 25 30 

Thr Arg Arg Thr Leu Arg Gly His Leu Ala Lys He Tyr Ala Met His 
35 40 45 

Trp Gly Thr Asp Ser Arg Leu Leu Val Ser Ala Ser Gin Asp Gly Lys 
" 55 60 



Leu He He Trp Asp Ser Glu Gly Asn Val Arg Tyr Thr Thr Asn Lys 
" '0 75 80 

Val flis Ala He Pro Leu Arg Ser Ser Trp Val Met Thr Cys Ala Tyr 
85 90 ss 

Ala Pro ser Gly Asn Phe Val Ala Cys Gly Gly Leu Asp Asn He Cys 
"0 105 110 

ser He Tyr Ser Leu Lys Thr Arg Val Ser Arg Glu Leu Pro Gly His 
115 120 125 



Thr Gly Tyr Leu Ser Cys Cys Arg Phe Leu Asp Asp Asn Gin He He 
"'^ 135 140 

35 145 ^^"^ ""^^ '''''' ""^^ "^"^ Sly 



"° 155 160 



Gin Gin Thr Val V.y Fh^ Ala Gly His Ser Gly A..p Val ;<et Ser Leu 
1'5 179 

Ser Leu Ala Pro Asp Gly Arg Thr Phe Val Ser Gly Ala Cys Asp ..... . 



180 185 



190 



ser He Lys Leu Trp Asp Val Arg Asp Ser Met Cys Arg Gin Thr Phe 

200 205 



He Gly His Glu Ser Asp He Asn Ala Val Ala Phe Phe 



Pro Asn Glv 
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210 2X5 220 

Tyr Ala Phe Thr Thr Gly Ser Asp Asp Ala Thr Cys Arg Leu Phe Asp 
225 230 235 240 

5 

Leu Arg Ala Asp Gin Glu Leu Leu Met Tyr Ser His Asp Asn lie He 
245 250 255 

Cys Gly He Thr Ser Val Ala Phe Ser Arg Ser Gly Arg Leu Leu Leu 
" 260 265 270 

Ala Gly Tyr Asp Asp Phe Asn Cys Asn He Trp Asp Ala Met Lys Gly 
275 280 285 

15 Asp Arg Ala Gly Val Leu Ala Gly His Asp Asn Arg Val Ser Cys Leu 

290 295 300 

Gly Val Thr Asp Asp Gly Met Ala Val Ala Thr Gly Ser Trp Asp Ser 
310 315 320 

20 

Phe Leu Lys He Trp Asn 
325 

(2) INFORMATION FOR SEQ ID NO: 40: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 340 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

30 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

35 (iv) ANTI- SENSE: NO 

(vi) C?.IGIKAL SCUr.CE: 

iC) IN'DIVIDuAL ISOLATE: G- ES7A CI^OSOPH, Fig, 23 



40 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

Met Asn Glu Leu Asp Ser Leu Arg Gin Glu Ala Glu Ser Leu Lys Asn 
^5 10 15 

Ala He Arg Asp Ala Arg Lys Ala Ala Cys Asp Thr Ser Leu Leu Gin 



Vf09S!2l2S2 
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106 



20 



30 



Ala Ala Thr Ser Leu Glu Pro lie Gly Arg lie Gin Met Arg Thr Arg 
35 40 45 

5 

Arg Thr Leu Arg Gly His Leu Ala Lys lie Tyr Ala Met His Trp Gly 
50 55 60 

Asn Asp Ser Arg Asn Leu Val Ser Ala Ser Gin Asp Gly tys Leu He 
" .65 70 75 80 

Val Trp Asp Ser His Thr Thr Asn Lya Val His Ala He Pro Leu Arg 
85 90 95 

15 Ser Ser Trp Val Met Thr Cys Ala Tyr Ala Pro Ser Gly Ser Tyr Val 

100 105 no 

Ala cys Gly Gly Leu Asp Asn Met Cys Ser He Tyr Asn Leu Lys Thr 
115 120 125 

20 

Arg Glu Gly Asn Val Arg Val Ser Arg Glu Leu Pro Gly His Gly Gly 
130 135 

Tyr Leu Ser Cys Cys Arg Phe Leu Asp Asp Asn Gin He Val Thr Ser 
"0 155 160 

ser Gly Asp Met Ser Cys Gly Leu Trp Asp He Glu Thr Gly Leu Gin 
165 170 3^75 



30 



35 



45 



Val Thr ser Phe Leu Gly His Thr Gly Asp Val Met Ala Leu Ser Leu 
180 185 150 

Ala Pro Gin Cys Lys Thr Phe Val Ser Gly Ala Cys Asp Ala Ser Ala 
195 200 205 

Lys Leu Trp Asp He Arg Glu Gly Val Cys Lys Gin Thr Phe Pro Gly 



^-^ 220 

Kis Glu Ser Asp He Asn Ala Val 7l:a 5h3 Asn Gl 

*° 225 230 235 



y Gin Mn 



Phe Ala Thr Gly Ser Asp Asp Ala Thr Cys Arg Leu Phe Asp He Arg 
245 250 255 

Ala Asp Gin Glu Leu Ala Met Tyr Ser His Asp Asn He He Cys 
260 265 270 
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He Thr Ser Val Ala Phe Ser Lys Ser Gly Arg Leu Leu Leu Ala Gly 
275 280 285 

Tyr Asp Asp Phe Asn Cys Asn Val Trp Asp Thr Met Lys Ala Glu Arg 
5 290 295 300 

Ser Gly He Leu Ala Gly His Asp Asn Arg Val Ser Cys Leu Gly Val 
310 315 320 



10 



15 



Thr Glu Asn Gly Met Ala Val Ala Thr Gly Ser Trp Asp Ser Phe Leu 
325 330 335 

Arg Val Trp Asn 
340 

(2) INFORMATION FOR SEQ ID NO: 41: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 317 amino acids 
20 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

25 (iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 



30 



35 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: G-BETA HUMAN, Fig. 24 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

Met Thr Glu Gin Met Thr Leu Arg Gly Thr Leu Lys Gly His Asn Gly 
15 10 15 

Trp val Thr Gin lis Ala Thr Thr Pro Gin Phe Pro Asp Mot He Leu 
-J 23 30 

Ser Ala Ser Arg Asp Lys Thr He He Met Trp Lys Leu Thr Arg Aj . 
35 40 45 

Glu Thr Asn Tyr Gly He Pro Gin Arg Ala Leu Arg Gly His Ser His 
50 55 60 



40 
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Phe Val ser Asp Val Val lie Ser Ser' Asp Gly Gin Phe Ala Leu Ser 
" 70 75 80 

Gly Ser Trp Asp Gly Thr Leu Arg Leu Trp Asp Leu Thr Thr Gly Thr 
5 85 90 55 

Thr Thr Arg Arg Phe Val Gly His Thr Lya Asp Val Leu Ser Val Ala 
10° 105 



10 



15 



20 



25 



30 



35 



40 



45 



Phe Ser Ser Asp Asn Arg Gin He Val Ser Gly Ser Arg Asp Lys Thr 
115 120 125 

He Lys Leu Trp Asn Thr Leu Gly Val Cys Lys Tyr Thr Val Gin Asp 
130 135 140 



Glu Ser His Ser Glu Trp Val Ser cys Val Arg Phe Ser Pro Asn Ser 
"0 155 

Ser Asn Pro He He Val Ser Cys Gly Trp Asp Lys Leu Val Lys Val 
155 170 3^^5 

Trp Asn Leu Ala Asn Cys Lys Leu Lys Thr Asn His He Gly His Thr 



180 IBS 



190 



Gly Tyr Leu Asn Thr Val Thr Val Ser Pro Asp Gly Ser Leu Cys Ala 
195 200 205 

ser Gly Gly Lys Asp Gly Gin Ala Met Leu Trp Asp Leu Asn Glu Gly 
"0 215 220 

Lys His Leu Tyr Thr Leu Asp Gly Gly Asp He He Asn Ala Leu Cys 
"0 235 240 

Phe ser Pro Asn Arg Tyr Trp Leu Cys Ala Ala Thr Gly Pro Ser He 

250 255 

Lys He Tr? Acp Leu Glu Gly Lys Ha He Val Asp Glu Leu Lys Gin 



250 2S5 



70 



Glu Val He ser Thr Ser Ser Lys Ala Glu Pro Pro Gin Cys Thr J . .• 
275 280 285 

Leu Ala Trp Ser Ala Asp Gly Gin Thr Leu Phe Ala Gly Tyr Thr Asp 
2" 295 300 

Asn Leu Val Arg Val Trp Gin Val Thr He Gly Thr Arg 
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305 310 3?.S 

(2) INFORMATION FOR SBQ ID NO:42: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 340 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unJcnown 

10 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

15 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: G-Beta 2 (Human), Fig. 25 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:42: 

Met Ser Glu Leu Glu Gin Leu Arg Gin Glu Ala Glu Gin Leu Arg Asn 



25 



30 



40 



45 



10 



IS 



Gin He Arg Asp Ala Arg Lys Ala Cys Gly Asp Ser Thr Leu Thr Gin 
20 25 30 

lie Thr Ala Gly Leu Asp Pro Val Gly Arg He Gin Met Arg Thr Arg 
35 40 45 

Arg Thr Leu Arg Gly His Leu Ala Lys He Tyr Ala Met His Trp Gly 
50 55 60 

Thr Asp Ser Arg Leu Leu Val Ser Ala Ser Gin Asp Gly Lys Leu He 



35 65 70 75 



80 



He Trp Asp Ser Tyr Thr Thr Asn Lys VU His Ala He Pro Leu Arg 
S5 90 95 

Ser Ser Trp Val Met Thr Cys Ala Tyr Ala Pro Ser Gly Asn . -1 

"0 105 110 

Ala Cys Gly Gly Leu Asp Asn He Cys Ser He Tyr Ser Leu Lys Thr 

120 125 

Arg Glu Gly Asn Val Arg Val Ser Arg Glu Leu Pro Gly His Thr Gly 
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130 135 140 

Tyr Leu Ser Cys Cys Arg Phe Leu Asp Asp Asn Gin lie lie Tlir Ser 
145 150 155 160 

5 

Ser Gly Asp Thr Thr Cys Ala Leu Trp Asp lie Glu Thr Gly Gin Gin 
165 170 175 

Thr Val Gly Phe Ala Gly His Ser Gly Asp Val Met Ser Leu Ser Leu 
10 180 185 190 

Ala Pro Asp Gly Arg Thr Phe Val Ser Gly Ala Cys Asp Ala Ser He 
195 200 205 

15 Lys Leu Trp Asp Val Arg Asp Ser Met Cys Arg Gin Thr Phe He Gly 

210 215 220 

His Glu Ser Asp He Asn Ala Val Ala Phe Phe Pro Asn Gly Tyr Ala 
225 230 235 240 

20 

Phe Thr Thr Gly Ser Asp Asp Ala Thr Cys Arg Leu Phe Asp Leu Arg 
245 250 255 

Ala Asp Gin Glu Leu Leu Met Tyr Ser His Asp Asn He He Cys Gly 
25 260 265 270 

He Thr Ser Val Ala Phe Ser Arg Ser Gly Arg Leu Leu Leu Ala Gly 
275 280 285 

30 Tyr Asp Asp Phe Asn Cys Asn He Trp Asp Ala Met Lys Gly Asp Arg 

290 295 300 

Ala Gly Val Leu Ala Gly His Asp Asn Arg Val Ser Cys Leu Gly Val 
305 310 315 320 

35 

Thr Asp Asp Gly Met Ala Val Ala Thr Gly Ser Trp Asp Ser Phe Leu 
325 330 335 

Lys 11a Trp Asn 
40 ^ 340 

(2) INFORMATION FOR SEQ ID NO: 43: 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: la-J^no'^ii 

(ii) MOLECULE TYPE: protein 

5 (iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 
^® (C) INDIVIDUAL ISOLATE: G-Beta 4 (mouse). Fig. 26 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

Lys Lys Asx Glu Thr Asx Val Asn Met Gly Arg Tyr Thr Pro Arg He 
15 10 15 

Lys His He Lys Arg Pro Arg Arg Thr Asp Xaa Xaa Gly 
20 25 

(2) INFORMATION FOR SEQ ID NO: 44: 



15 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 718 amino acids 
25 (B) TYPE: amino acid 

(D) TOPOLOGY: untoown 

(ii) MOLECULE TYPE: protein 

30 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



35 



40 



45 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: GROUCHO PROTEIN DROSOPH, Fig. 27 

(xi) SSCUE^ICE DESCRIPTION: SEQ ID NO:44: 

Met Tyr Pro Ser Pro Val Arg His Pro Ala Ala Gly Gly Pro Pr-. . 
1 5 10 15 

Gin Gly Pro He Lys Phe Thr He Ala Asp Thr Leu Glu Arg He Lys 
20 25 30 

Glu Glu Phe Asn Phe Leu Gin Ala His Tyr His Ser He Lys Leu Glu 
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35 40 45 

I 

Cys Glu Lys Leu Ser Asn Glu Lys Thr Glu Met Gin Arg His Tyr Val 
50 55 60 

5 

Met Tyr Tyr Glu Met Ser Tyr Gly Leu Asn Val Glu Met His Lys Gin 
" 70 75 80 

Thr Glu lie Ala Lys Arg Leu Asn Thr Leu lie Asn Gin Leu Leu Pro 
10 85 90 95 

Phe Leu Gin Ala Asp His Gin Gin Gin Val Leu Gin Ala Val Glu Arg 
100 105 110 

15 Ala Lys Gin Val Thr Met Gin Glu Leu Asn Leu lie He Gly Gin Gin 

lis 120 125 

He His Ala Gin Gin Val Pro Gly Gly Pro Pro Gin Pro Met Gly Ala 
130 135 140 

20 

Leu Asn Pro Phe Gly Ala Leu Gly Ala Thr Met Gly Leu Pro His Gly 
"5 150 155 160 

Pro Gin Gly Leu Leu Asn Lys Pro Pro Glu His His Arg Pro Asp He 
25 165 170 175 

Lys Pro Thr Gly Leu Glu Gly Pro Ala Ala Ala Glu Glu Arg Leu Arg 
"0 185 190 



30 



35 



40 



45 



Asn ser Val Ser Pro Ala Asp Arg Glu Lys Tyr Arg Thr Arg Ser Pro 
195 200 205 

Leu Asp He Glu Asn Asp Ser Lys Arg Arg Lys Asp Glu Lys Leu Gin 
210 215 220 

Glu Asp Glu Gly Glu Lys Ser Asp Gin Asp Leu Val Val Asp Val Ala 
130 235 240 

Asn Glu Met Glu Ser His rro " v ' :r. Z\/ Glu Ili^ Ser 

245 250 2Sr 

Met Glu Val Arg Asp Arg Glu Ser Leu Asn Gly Glu Arg Leu Glu Lys 
260 265 270 

Pro Ser Ser Ser Gly He Lys Gin Glu Arg Pro Pro Ser Arg Ser Gly 
275 280 285 
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10 



15 
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Ser Ser Ser Ser Arg Ser Thr Pro Ser Leu Lys Thr Lys Asp Met Glu 
290 295 300 

Lys Pro Gly Thr Pro Gly Ala Lys Ala Arg Thr Pro Thr Pro Asn Ala 
305 310 315 320 

Ala Ala Pro Ala Pro Gly Val Asn Pro Lys Gin Met Met Pro Gin Gly 
325 330 335 

Pro Pro Pro Ala Gly Tyr Pro Gly Ala Pro Tyr Gin Arg Pro Ala Asp 
340 345 350 

Pro Tyr Gin Arg Pro Pro Ser Asp Pro Ala Tyr Gly Arg Pro Pro Pro 
355 360 365 

Met Pro Tyr Asp Pro His Ala His Val Arg Thr Asn Gly lie Pro His 
370 375 380 

Pro Ser Ala Leu Thr Gly Gly Lys Pro Ala Tyr Ser Phe His Met Asn 
20 385 390 395 400 

Gly Glu Gly Ser Leu Gin Pro Val Pro Phe Pro Pro Asp Ala Leu Val 
405 410 415 

25 Gly Val Gly He Pro Arg His Ala Arg Gin He Asn Thr Leu Ser His 

420 425 430 

Gly Glu Val Val Cys Ala Val Thr He Ser Asn Pro Thr Lys Tyr Val 
435 440 445 

30 

Tyr Thr Gly Gly Lys Gly Cys Val Lys Val Trp Asp He Ser Gin Pro 
450 455 460 

Gly Asn Lys Asn Pro Val Ser Gin Leu Asp Cys Leu Gin Arg Asp Asn 
35 465 470 475 480 

Tyr He Arg Ger Val Lys Leu Leu Pro Asp Gly Arg Thr Leu He Val 
485 490 495 



40 



45 



Gly Gly Glu Ala Ser Asn Leu Ser He Trp Asp Leu Ala Ser Pro 
500 505 510 

Pro Arg He Lys Ala Glu Leu Thr Ser Ala Ala Pro Ala Cys Tyr Ala 
515 520 525 

Leu Ala Ser Pro Asp Ser Lys Val Cys Phe Ser Cys Cys Ser Asp Gly 
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530 535 549 

Asn He Ala Val Trp Asp Leu His Asn Glu He Leu Val Arg Gin Phe 
545 550 555 560 

5 

Gin Gly His Thr Asp Gly Ala Ser Cys He Asp He Ser Pro Asp Gly 
565 570 575 

Ser Arg Leu Trp Thr Gly Gly Leu Asp Asn Thr Val Arg Ser Trp Asp 
10 580 585 590 

Leu Arg Glu Gly Arg Gin Leu Gin Gin His Asp Phe Ser Ser Gin He 
595 600 605 

15 Phe Ser Leu Gly Tyr Cys Pro Thr Gly Asp Trp Leu Ala Val Gly Met 

610 615 620 

Glu Asn Ser His Val Glu Val Leu His Ala Ser Lys Pro Asp Lys Tyr 
625 630 635 640 

20 

Gin Leu His Leu His Glu Ser Cys Val Leu Ser Leu Arg Phe Ala Ala 
645 650 655 

Cys Gly Lys Trp Phe Val Ser Thr Gly Lys Asp Asn Leu Leu Asn Ala 
25 660 665 670 

Trp Arg Thr Pro Tyr Gly Ala Ser He Phe Gin Ser Lys Glu Thr Ser 
675 680 685 

30 Ser Val Leu Ser Cys Asp He Ser Thr Asp Asp Lys Tyr He Val Thr 

690 695 700 

Gly Ser Gly Asp Lys Lys Ala Thr Val Tyr Glu Val He Tyr 
705 710 715 

35 

(2) :NF0R>OTIC:; for SEQ id :T0:45: 

(i) SEQUENCZ CrL^RACZI.:: : 7 r"/:^. : 
40 (A) LENGTH: 341 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknovm 



45 



(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
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10 



20 



25 



30 



35 



40 



45 



- lis - 

(iv) AMTX-SENSB: NO 

(vi) ORIGIJIAL SOTOCE: 

(C) INDIVIDDM. ISOLATE; GTP binding protein (squid). Pig. 28 

(xi) SEQDENCE DESCRIPTZOK: SEQ 10 NO: 45: 

Met Thr Ser Glu Leu Glu Ala Leu Arg Gin Glu Thr Glu Gin Leu Lys 
is 10 IS . 

Asn Gin lie Arg Glu Ala Arg Lys Ala Ala Ala Asp Thr Thr Leu Ala 
20 25 30 

Met Ala Thr Ala Asn Val Glu Pro Val Gly Arg He Gin Met Arg Thr 



15 35 40 



45 



Arg Arg Thr Leu Arg Gly His Leu Ala Lys He Tyr Ala Met His Trp 
5° 55 60 

Ala Ser Asp Ser Arg Asn Leu Val Ser Ala Ser Gin Asp Gly Lys Leu 



65 70 75 



80 



lie Val Trp Asp Gly Tyr Thr Thr Asn Lys Val His Ala He Pro Leu 
85 90 35 

Arg Ser Ser Trp Val Met Thr Cys Ala Tyr Ala Pro Ser Gly Asn Tyr 
1°° 105 

Val Ala Cys Gly Gly Leu Asp Asn He Cys Ser He Tyr Ser Leu Lys 
115 120 125 

Thr Arg Glu Gly Asn Val Arg Val Ser Arg Glu Leu Pro Gly His Thr 
130 135 140 



Gly Tyr Leu Ser Cys Cys Arg Phe He Asp Asp Asn Gin He Val Thr 

160 



145 150 155 



Ser ssr c-ly As? Met Thr Cys Ala Leu Tr? Asn He Glu Thr Gly Asn 
loi 170 

Gin He Thr Ser Phe Gly Gly His Thr Gly Asp Val Met ser Leu Ser 
"0 185 190 

Leu Ala Pro Asp Met Arg Thr Phe Val Ser Gly Ala Cys Asp Ala Ser 

200 205 
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Ala Lys Leu Phe Asp He Arg Asp Gly He Cys Lys Gin Thr Phe Thr 
210 215 220 

Gly His Glu Ser Asp He Asn Ala He Thr Tyr Phe Pro Asn Gly Phe 
5 225 230 235 240 

Ala Phe Ala Thr Gly Ser Asp Asp Ala Thr Cys Arg Leu Phe Asp He 
245 250 255 

10 Arg Ala Asp Gin Glu He Gly Met Tyr Ser His Asp Asn He He Cys 

260 265 270 

Gly He Thr Ser Val Ala Phe Ser Lys Ser Gly Arg Leu Leu Leu Gly 
275 280 285 

15 

Gly Tyr Asp Asp Phe Asn Cys Asn Val Trp Asp Val Leu Lys Gin Glu 
290 295 300 

Arg Ala Gly Val Leu Ala Gly His Asp Asn Arg Val Ser Cys Leu Gly 
20 305 310 315 320 

Val Thr Glu Asp Gly Met Ala Val Ala Thr Gly Ser Trp Asp Ser Phe 
325 330 335 

25 Leu Lys He Trp Asa 

340 

(2) INFORMATION FOR SEQ ID NO: 46: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 410 amino acids . 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

35 (ii) MOLECULE TYPE: protein 

(iii) KYF0TH2TICAL: NO 

(iv) ANTI-o£:Jo£: no 

40 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: lEF SSP 9306, Fig. 29 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 
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Met Ala Aap Lys Giu Ala Ala Phe Asp Asp Ala Val Glu Glu Arg Val 



10 



15 



20 



25 



30 



35 



40 



45 



10 



15 



He Asn Glu Glu Tyr Lys He Trp Lys Lys Asn Thr Pro Phe Leu Tyr 
20 25 30 

Asp Leu Val Met Thr His Ala Leu Glu Trp Pro Ser Leu Thr Ala Gin 
3S 40 45 

Trp Leu Pro Asp Val Thr Arg Pro Glu Gly Lys Asp Phe Ser He His 
50 55 60 

Arg Leu Val Leu Gly Thr His Thr Ser Asp Glu Gin Asn His Leu Val 
" 70 75 80 

He Ala Ser Val Gin Leu Pro Asn Asp Asp Ala Gin Phe Asp Ala Ser 



65 90 



95 



His Tyr Asp Ser Glu Lys Gly Glu Phe Gly Gly Phe Gly Ser Val Ser 
100 105 no 

Gly Lys He Glu He Glu He Lys He Asn His Glu Gly Glu Val Asn 
H5 120 125 

Arg Ala Arg Tyr Met Pro Gin Asn Pro Cys He He Ala Thr Lys Thr 
"0 135 140 

Pro ser Ser Asp Val Leu Val Phe Asp Tyr Thr Lys His Pro Ser Lys 
«0 155 igo 

Pro Asp Pro ser Gly Glu Cys Asn Pro Asp Leu Arg Leu Arg Gly His 

170 3^75 

Gin Lys Glu Gly Tyr Gly Leu Ser Trp Asn Pro Asn Leu Ser Gly His 



180 185 



190 



Leu Leu Ser Ma Ser As? Asp His Thr He Cj-s Leu Trp Aso He Ser 
195 200 205 

Ala val Pro Lys Glu Gly Lys Val Val Asp Ala Lys Thr He Phe - ■ 
210 215 220 



Gly His Thr Ala Val Val Glu Asp Val Ser Trp His Leu Leu His Glu 
"5 230 235 240 

Ser Leu Phe Gly Ser Val Ala Asp Asp Gin Lys Leu Met He Trp Asp 
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245 ' 250 255 

Thr Arg Ser Asn Asn Thr Ser Lys Pro Ser His Ser Val Asp Ala His 
260 265 270 

5 

Thr Ala Glu Val Asn Cys Leu Ser Phe Asn Pro Tyr Ser Glu Phe He 
275 280 285 

Leu Ala Thr Gly Ser Ala Asp Lys Thr Val Ala Leu Trp Asp Leu Arg 
" 290 295 300 

Asn Leu Lys Leu Lys Leu His Ser Phe Glu Ser His Lys Asp Glu He 
310 315 320 

15 Phe Gin Val Gin Trp Ser Pro His Asn Glu Thr He Leu Ala Ser Ser 

325 330 335 

Gly Thr Asp Arg Arg Leu Asn Val Trp Asp Leu Ser Lys He Gly Glu 
340 345 

20 

Glu Gin Ser Pro Glu Asp Ala Glu Asp Gly Pro Pro Glu Leu Leu Phe 
355 360 365 

He His Gly Gly His Thr Ala Lys He Ser Asp Phe Ser Trp Asn Pro 
25 370 375 380 

Asn Glu Pro Trp Val He Cys Ser Val Ser Glu Asp Asn He Met Gin 
390 395 400 

30 Val Trp Gin Met Glu Leu. Val Leu Asp His 

405 410 

(2) INFORMATION FOR SEQ ID N0:47: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 317 amino acids 
(3) Ty?E: arrino icid 
{D) TCFCLCGY: ur.kr.own 

40 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 



45 



(iv) ANTI-SENSE: NO 
(vi) ORIGINAL SOURCE: 
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(C) INDIVIEUAL ISOLX.TE: HUTIAN 12.3, Fig. 30 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:47: 

5 

Met Thr Glu Gin Met Thr Leu Arg Gly Thr Leu Lys Gly His Asn Gly 
15 10 15 

Trp Val Thr Gin lie Ala Thr Thr Pro Gin Phe Pro Asp Met lie Leu 
10 20 25 30 

Ser Ala Ser Arg Asp Lys Thr lie lie Met Trp Lys Leu Thr Arg Asp 
35 40 45 

15 V Glu Thr Asn Tyr Gly He Pro Gin Arg Ala Leu Arg Gly His Ser His 
50 55 60 

Phe Val Ser Asp Val Val He Ser Ser Asp Gly Gin Phe Ala Leu Ser 
^5 70 75 80 

20 

Gly Ser Trp Asp Gly Thr Leu Arg Leu Trp Asp Leu Thr Thr Gly Thr 
85 90 95 

Thr Thr Arg Arg Phe Val Gly His Thr Lys Asp Val Leu Ser Val Ala 
25 100 105 110 

Phe Ser Ser Asp Asn Arg Gin He Val Ser Gly Ser Arg Asp Lys Thr 
115 120 125 



30 



35 



40 



45 



He Lys Leu Trp Asn Thr Leu Gly Val Cys Lys Tyr Thr Val Gin Asp 
130 135 140 

Glu Ser His Ser Glu Trp Val Ser Cys Val Arg Phe Ser Pro Asn Ser 
150 155 160 

Ser Asn Pro He He Val Ser Cys Gly Trp Asp Lys Leu Val Lys Val 
lo5 170 175 

^-P -^-S" Leu Ala Asn Cy3 Lys I. i-i .V/t 7 - r .--rrt Hi 7. r'. » Oly Hio Thr 
180 185 190 

Gly Tyr Leu Asn Thr Val Thr Val Ser Pro Asp Gly Ser Leu Cys Ala 
195 200 205 

Ser Gly Gly Lys Asp Gly Gin Ala Met Leu Trp Asp Leu Asn Glu Gly 
210 215 220 
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Lys His Leu Tyr Thr Leu Asp Gly Gly Asp He He Asn Ala Leu Cys 

225 230 235 240 

Phe Ser Pro Asn Arg Tyr Trp Leu Cys Ala Ala Thr Gly Pro Ser He 
5 245 250 255 

Lys He Trp Asp Leu Glu Gly Lys He He Val Asp Glu Leu Lys Gin 
260 265 270 

10 Glu Val He Ser Thr Ser Ser Lys Ala Glu Pro Pro Gin Cys Thr Ser 

275 280 285 

Leu Ala Trp Ser Ala Asp Gly Gin Thr Leu Phe Ala Gly Tyr Thr Asp 
290 295 300 

15 

Asn Leu Val Arg Val Trp Gin Val Thr He Gly Thr Arg 
305 310 315 

(2) INFORMATION FOR SEQ ID N0:48: 

20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 425 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

25 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
30 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: lEF -7442 - human. Fig. 31 

35 

(xi) SSQUSNCS DZSCRIPTION: SSQ ID NO:48: 

Met Ala Ser Lys Glu Met Phe Glu Asp Thr Val Glu Glu Arg Val He 
^ ^ 10 15 

40 

Asn Glu Glu Tyr Lys He Trp Lys Lys Asn Thr Pro Phe Leu Tyr Ai.:; 
20 25 30 

Leu Val Met Thr His Ala Leu Gin Trp Pro Ser Leu Thr Val Gin Trp 
45 35 40 45 
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Leu Pro Glu Val Thr Lys Pro Glu Gly Lys Asp Tyr Ala Leu His Trp, 
50 55 60 

Leu Val Leu Gly Thr His Thr Ser Asp Glu Gin Asn His Leu Val Val 
5 65 70 75 80 

Ala Arg Val Bis lie Pro Asn Asp Asp Ala Gin Phe Asp Ala Ser His 
85 90 95 

10 Cys Asp Ser Asp Lys Gly Glu Phe Gly Gly Phe Gly Ser Val Thr Gly 

100 105 110 

Lys lie Glu Cys Glu lie Lys lie Asn His Glu Gly Glu Val Asn Arg 
115 120 125 

15 

Ala Arg Tyr Met Pro Gin Asn Pro His lie lie Ala Thr Lys Thr Pro 
130 135 140 

Ser Ser Asp Val Leu Val Phe Asp Tyr Thr Lys His Pro Ala Lys Pro 
20 145 150 155 160 

Asp Pro Ser Gly Glu Cys Asn Pro Asp Leu Arg Leu Arg Gly His Gin 
165 170 175 

25 Lys Glu Gly Tyr Gly Leu Ser Trp Asn Ser Asn Leu Ser Gly His Leu 

. 180 185 190 

Leu Ser Ala Ser Asp Asp His Thr Val Cys Leu Trp Asp lie Asn Ala 
195 200 205 

30 

Gly Pro Lys Glu Gly Lys He Val Asp Ala Lys Ala He Phe Thr Gly 
210 215 220 

His Ser Ala Val Val Glu Asp Val Ala Trp His Leu Leu His Glu Ser 
35 225 230 235 240 

Leu Phe Gly Ser Val Ala Asp Asp Gin Lys Leu Met He Trp Asp Thr 
245 250 255 

40 Arg Ser Asn Thr Thr Ser Lys Pro Ser His Leu Val Asp Ala H-.^ :r.r 

260 265 270 

Ala Glu Val Asn Cys Leu Ser Phe Asn Pro Tyr Ser Glu Phe He Leu 
275 280 285 



45 



Ala Thr Gly Ser Ala Asp Lys Thr Val Ala Leu Trp Asp Leu Arg Asn 
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230 295 300 

Leu Lys Leu Lys Leu His Thr Phe Glu Ser His Lys Asp Glu lie Phe 
305 310 315 320 

S 

Gin Val His Trp Ser Pro His Asn Glu Thr lie Leu Ala Ser Ser Gly 
325 330 335 

Thr Asp Arg Arg Leu Asn Val Trp Asp Leu Ser Lys lie Gly Glu Glu 
10 340 345 350 

Gin Ser Ala Glu Asp Ala Glu Asp Gly Pro Pro Glu Leu Leu Phe lie 
355 360 365 

2-5 His Gly Gly His Thr Ala Lys He Ser Asp Phe Ser Trp Asn Pro Asn 

370 375 380 

Glu Pro Trp Val He Cys Ser Val Ser Glu Asp Asn He Met Gin He 
385 390 395 400 

20 

Trp Gin Met Ala Glu Asn He Tyr Asn Asp Glu Glu Ser Asp Val Thr 
405 410 415 

Thr Ser Glu Leu Glu Gly Gin Gly Ser 
25 420 425 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 605 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknovm 



35 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) a: 71 -SENSE: i/0 

40 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Insulin- like growth factor bincl::j 
protein complex. Pig. 32 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

Met Ala Leu Arg Lys Gly Gly Leu Ala Leu Ala Leu Leu Leu Leu Ser 



] 
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15 10 IS 

Trp Val Ala Leu Gly Pro Arg Ser Leu Glu Gly Ala Asp Pro Gly Thr 
20 25 30 

5 

Pro Gly Glu Ala Glu Gly Pro Ala Cys Pro Ala Ala Cys Val Cys Ser 
35 40 45 

Tyr Asp Asp Asp Ala Asp Glu Leu Ser Val Phe Cys Ser Ser Arg Asn 
10 50 55 60 

Leu Thr Arg Leu Pro Asp Gly Val Pro Gly Gly Thr Gin Ala Leu Trp 
65 70 75 80 

15 Leu Asp Gly Asn Asn Leu Ser Ser Val Pro Pro Ala Ala Phe Gin Asn 

85 90 95 

Leu Ser Ser Leu Gly Phe Leu Asn Leu Gin Gly Gly Gin Leu Gly Ser 
100 105 110 

20 

Leu Glu Pro Gin Ala Leu Leu Gly Leu Glu Asn Leu Cys His Leu His 
115 120 125 

Leu Glu Arg Asn Gin Leu Arg Ser Leu Ala Leu Gly Thr Phe Ala His 
25 130 135 140 

Thr Pro Ala Leu Ala Ser Leu Gly Leu Ser Asn Asn Arg Leu Ser Arg 
145 150 155 160 

30 Leu Glu Asp Gly Leu Phe Glu Gly Leu Gly Ser Leu Trp Asp Leu Asn 

165 170 175 

Leu Gly Trp Asn Ser Leu Ala Val Leu Pro Asp Ala Ala Phe Arg Gly 
180 185 190 

35 

Leu Gly Ser Leu Arg Glu Leu Val Leu Ala Gly Asn Arg Leu Ala Tyr 
1S3 200 205 

Ldu Gin Fro Aia Leu Phe Ser Gly Leu Ala Clu Lnu Arc Glu Leu Asp 
40 210 215 220 

Leu Ser Arg Asn Ala Leu Arg Ala He Lys Ala Asn Val Phe Val Gir. 
225 230 235 240 



45 



Leu Pro Arg Leu Gin Lys Leu Tyr Leu Asp Arg Asn Leu He Ala Ala 
245 250 255 
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Val Ala Pro Gly Ala Phe Leu Gly Leu Lys Ala Leu Arg Trp Leu Asp 
260 265 270 

Leu Ser His Asn Arg Val Ala Gly Leu Leu Glu Asp Thr Phe Pro Gly 
5 275 260 285 

Leu Leu Gly Leu Arg Val Leu Arg Leu Ser His Asn Ala lie Ala Ser 
290 295 300 

10 Leu Arg Pro Arg Thr Phe Lys Asp Leu His Phe Leu Glu Glu Leu Gin 

305 310 315 320 

Leu Gly His Asn Arg lie Arg Gin Leu Ala Glu Arg Ser Phe Glu Gly 
325 330 335 

15 

Leu Gly Gin Leu Glu Val Leu Thr Leu Asp His Asn Gin Leu Gin Glu 
340 345 350 

Val Lys Ala Gly Ala Phe Leu Gly Leu Thr Asn Val Ala Val Met Asn 
20 355 360 365 

Leu Ser Gly Asn Cys Leu Arg Asn Leu Pro Glu Gin Val Phe Arg Gly 
370 375 380 

25 Leu Gly Lys Leu His Ser Leu His Leu Glu Gly Ser Cys Leu Gly Arg 

385 390 395 400 

lie Arg Pro His Thr Phe Thr Gly Leu Ser Gly Leu Arg Arg Leu Phe 
405 410 415 

30 

Leu Lys Asp Asn Gly Leu Val Gly He Glu Glu Gin Ser Leu Trp Gly 
420 425 430 

Leu Ala Glu Leu Leu Glu Leu Asp Leu Thr Ser Asn Gin Leu Thr His 
3S 435 440 445 

Leu Fro Kia Arg Leu Phe Gin Gly Leu Gly Lys Leu Glu Tyr Leu Leu 
•130 455 460 



40 



45 



Leu Ser Arg Asn Arg Leu Ala Glu Leu Pro Ala Asp Ala Leu Gly 
465 470 475 430 

Leu Gin Arg Ala Phe Trp Leu Asp Val Ser His Asn Arg Leu Glu Ala 
485 490 495 

Leu Pro Asn Ser Leu Leu Ala Pro Leu Gly Arg Leu Arg Tyr Leu Ser 
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500 505' 510 

Leu Arg Asn Asn Ser Leu Arg Thr Phe Thr Pro Gin Pro Pro Gly Leu 
515 520 525 

5 

Glu Arg Leu Trp Leu Glu Gly Asn Pro Trp Asp Cys Gly Cys Pro Leu 
530 535 540 

Lys Ala Leu Arg Asp Phe Ala Leu Gin Asn Pro Ser Ala Val Pro* Arg 
10 545 550 555 560 

Phe Val Gin Ala He Cys Glu Gly Asp Asp Cys Gin Pro Pro Ala Tyr 
565 570 575 



15 



20 



35 



40 



45 



Thr Tyr Asn Asn He Thr Cys Ala Ser Pro Pro Glu Val Val Gly Leu 
580 585 590 . 

Asp Leu Arg Asp Leu Ser Glu Ala His Phe Ala Pro Cys 
595 600 605 

(2) INFORMATION FOR SEQ ID NO: 50: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 603 amino acids 

25 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

30 (iii) HYPOTHETICAL; NO 

(iv) AOTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Insulin-like growth factor bind, 
pro. con:pIex-rat, Fig. 33 

(:ci) SEQUTHNCZ DESCRIPTION: SEQ ID NO: 50: 

Met Ala Leu Arg Thr Gly Gly Pro Ala Leu Val Val Leu Leu Al-. 
15 10 15 

Trp Val Ala Leu Gly Pro Cys His Leu Gin Gly Thr Asp Pro Gly Ala 
20 25 30 

Ser Ala Asp Ala Glu Gly Pro Gin Cys Pro Val Ala Cys Thr Cys Ser 
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35 



40 



45 



His Asp Asp Tyx Thr Asp Glu Leu Ser Val Phe Cys Ser Ser Lys Asn 
50 55 €0 



Leu Thr His Leu Pro Asp Asp lie Pro Val Ser Thr Arg Ala Leu Trp 
65 70 75 



80 



10 



Leu Asp Gly Asn Asn Leu Ser Ser lie Pro Ser Ala Ala Phe Gin Asn 
85 90 95 



Leu Ser Ser Leu Asp Phe Leu Asn Leu Gin Gly Ser Trp Leu Arg Ser 
100 105 110 



15 



Leu Glu Pro Gin Ala Leu Leu Gly Leu Gin Asn Leu Tyr Tyr Leu His 
115 120 125 



20 



Leu Glu Arg Asn Arg Leu Arg Asn Leu Ala Val Gly Leu Phe Thr His 
130 135 140 

Thr Pro Ser Leu Ala Ser Leu Ser Leu Ser Ser Asn Leu Leu Gly Arg 
150 155 160 



25 



Leu Glu Glu Gly Leu Phe Gin Gly Leu Ser His Leu Trp Asp Leu Asn 



165 



170 



175 



Leu Gly Trp Asn Ser Leu Val Val Leu Pro Asp Thr Val Phe Gin Gly 
180 185 190 



30 



Leu Gly Asn Leu His Glu Leu Val Leu Ala Gly Asn Lys Leu Thr Tyr 



195 



200 



205 



35 



Leu Gin Pro Ala Leu Phe Cys Gly Leu Gly Glu Leu Arg Glu Leu Asp 
210 215 220 

Leu Ser Arg Asn Ala Leu Arg Ser Val Lys Ala Asn Val Phe Val His 
230 235 240 



40 



Leu Fro Ar^- Leu Gin Lys Liu Tyr L-u Ajj. Arj Leu lie Tr.r Ala 

245 250 233 



Val Ala Pro Gly Ala Phe Leu Gly Met Lys Ala Leu Arg Trp Leu Asp 



260 



265 



270 



45 



Leu Ser His Asn Arg Val Ala Gly Leu Met Glu Asp Thr Phe Pro Gly 
275 280 285 
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Leu Leu Gly Leu His Val Leu Arg Lsu Ala His Asri Ala He Ala Ser 
290 295 300 

Leu Arg Pro Arg Thr Phe Lys Asp Leu His Phe Leu Glu Glu Leu Gin 
305 310 315 320 

Leu Gly His Asn Arg He Arg Gin Leu Gly Glu Arg Thr Phe Glu Gly 
325 330 335 



10 



Leu Gly Gin Leu Glu Val Leu Thr Leu Asn Asp Asn Gin He Thr Glu 
340 3"45 350 



15 



Val Arg Val Gly Ala Phe Ser Gly Leu Phe Asn Val Ala Val Met Asn 
355 360 365 

Leu Ser Gly Asn Cys Leu Arg Ser Leu Pro Glu Arg Val Phe Gin Gly 
370 375 380 



20 



Leu Asp Lys Leu His Ser Leu His Leu Glu His Ser Cys Leu Gly His 
385 390 395 4OO 



Val Arg Leu His Thr Phe Ala Gly Leu Ser Gly Leu Arg Arg Leu Phe 
405 410 415 



25 



Leu Arg Asp Asn Ser He Ser Ser He Glu Glu Gin Ser Leu Ala Gly 
420 425 430 



30 



Leu Ser Glu Leu Leu Glu Leu Asp Leu Thr Thr Asn Arg Leu Thr His 
435 440 445 

Leu Pro Arg Gin Leu Phe Gin Gly Leu Gly His Leu Glu Tyr Leu Leu 
450 455 460 



35 



Leu Ser Tyr Asn Gin Leu Thr Thr Leu Ser Ala Glu Val Leu Gly Pro 
465 470 475 



480 



Leu Gin Arg Ala Phe Trp L-u Asp He Ser His Asn His Le^i Glu Thr 
^35 490 ^05 



40 



Leu Ala Glu Gly Leu Phe Ser Ser Leu Gly Arg Val Arg Tyr Le 
500 505 510 



45 



Leu Arg Asn Asn Ser Leu Gin Thr Phe Ser Pro Gin Pro Gly Leu Glu 
515 520 525 

Arg Leu Trp Leu Asp Ala Asn Pro Trp Asp Cys Ser Cys Pro Leu Lys 
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530 535 540 

Ala Leu Arg Asp Phe Ala Leu Gin Asn Pro Gly Val Val Pro Arg Phe 
545 550 555 560 

5 

Val Gin Thr Val Cys Glu Gly Asp Asp Cys Gin Pro Val Tyr Thr Tyr 
565 570 575 

Asn Asn lie Thr Cys Ala Gly Pro Ala Asn Val Ser Gly Leu Asp Leu 
10 580 585 590 

Arg Asp Val Ser Glu Thr His Phe Val His Cys 
595 600 

15 (2) INFORMATION FOR SEQ ID NO: 51: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 409 amino acids 

(B) TYPE: amino acid 
20 (D) TOPOLOGY: un3aiown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: LISl (human), Fig. 34 



25 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

Met Val Leu Ser Gin Arg Gin Arg Asp Glu Leu Asn Arg Ala lie Ala 
35 1 5 10 15 

Asp Tyr L=u Arg Ser Asn Gly Tyr Clu Glu Ala Tyr Ser Val Phe Lys 
2D 25 30 

40 Lys Glu Ala Glu Leu Asp Val Asn Glu Glu Leu Asp Lys Lys Ty. ... 

35 40 45 

Gly Leu Leu Glu Lys Lys Trp Thr Ser Val lie Arg Leu Gin Lys Lys 
50 55 60 



45 



Val Met Glu Leu Glu Ser Lys Leu Asn Glu Ala Lys Glu Glu Phe Thr 
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65 70 75 80 

Ser Gly Gly Pro Leu Gly Gin Lys Arg Asp Pro Lys Glu Trp lie Pro 
85 90 95 

Arg Pro Pro Glu Lys Tyr Ala Leu Ser Gly His Arg Ser Pro Val Thr 
100 105 110 



Arg Val He Phe His Pro Val Phe Ser Val Met Val Ser Ala Ser Glu 
10 115 120 125 

Asp Ala Thr He Lys Val Trp Asp Tyr Glu Thr Gly Asp Phe Glii Arg 
130 135 140 

15 Thr Leu Lys Gly His Thr Asp Ser Val Gin Asp He Ser Phe Asp His 

145 150 155 160 



20 



Ser Gly Lys Leu Leu Ala Ser Cys Ser Ala Asp Met Thr He Lys Leu 
165 170 175 

Trp Asp Phe Gin Gly Phe Glu Cys He Arg Thr Met His Gly His Asp 
180 185 190 



His Asn Val Ser Ser Val Ala He Met Pro Asn Gly Asp His He Val 
25 195 200 205 

Ser Ala Ser Arg Asp Lys Thr He Lys Met Trp Glu Val Gin Thr Gly 
210 215 220 

30 Tyr Cys Val Lys Thr Phe Thr Gly His Arg Glu Trp Val Arg Met Val 

225 230 235 240 

Arg Pro Asn Gin Asp Gly Thr Leu He Ala Ser Cys Ser Asn Asp Gin 
245 250 255 

35 

Thr Val Arg Val Trp Val Val Ala Thr Lys Glu Cys Lys Ala Glu Leu 
260 255 270 



Arg Giu liis Glu His Val "al Glu r\— Tvp All ?rD Glu Ser 

40 275 280 285 



Ser Tyr Ser Ser He Ser Glu Ala Thr Gly Ser Glu Thr Lys Lys S-r^r 
290 295 300 

45 Gly Lys Pro Gly Pro Phe Leu Leu Ser Gly Ser Arg Asp Lys Thr Lys 

305 310 315 320 
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Met Trp Asp Val Ser Thr Gly Met Cys Leu Met Thr Leu Val Gly His 
325 330 335 

Asp Asn Trp Val Arg Gly Val Leu Phe His Ser Gly Gly Lys Phe lie 
5 340 345 350 

Leu Ser Cys Ala Asp Asp Lys Thr Leu Arg Val Trp Asp Tyr Lys Asn 
355 360 365 

10 Lys Arg Cys Met Lys Thr Leu Asn Ala His Glu His Phe Val Thr Ser 

370 375 380 

Leu Asp Phe His Lys Thr Ala Pro Tyr Val Val Thr Gly Ser Val Asp 
385 390 395 400 

15 

Gin Thr Val Lys Val Trp Glu Cys Arg 
405 

(2) INFORMATION FOR SEQ ID NO: 52: 

20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 422 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

25 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
30 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: MD6 , Fig. 35 

35 

(xi) SEQUENCE D2SCRIFTICN: SEQ ID WO: 52: 

H3t Glu Arg Lys Asp Phe Glu Thr Trp Leu Asp Asn lie Ser Val Thr 
15 10 15 

40 

Phe Leu Ser Leu Met Asp Leu Gin Lys Asn Glu Thr Leu Asp His Leu 
20 25 30 

He Ser Leu Ser Gly Ala Val Gin Leu Arg His Leu Ser Asn Asn Leu 
45 35 40 45 
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Glu Thr Leu Leu Lys Arg Phe Leu Lys Leu Leu Pro Leu Glu Leu 
50 53 60 

Ser Phe Tyr Leu Leu Lys Trp Leu Asp Pro Gin Thr Leu Leu Thr Cys 
5 65 70 75 80 

Cys Leu Val Ser Lys Gin Arg Asn Lys Val lie Ser Ala Cys Thr Glu 
85 90 95 

10 Val Trp Gin Thr Ala Cys Lys Asn Leu Gly Trp Gin He Asp Asp Ser 

100 105 110 



15 



Val Gin Asp Ser Leu His Trp Lys Lys Val T\t: Leu Lys Ala He Leu 
115 120 125 

Arg Met Lys Gin Leu Glu Asp His Glu Ala Phe Glu Thr Ser Ser Leu 
130 135 140 



20 



He Gly His Ser Ala Arg Val Tyr Ala Leu Tyr Tyr Lys Asp Gly Leu 
145 150 155 160 



Leu Cys Thr Gly Ser Asp Asp Leu Ser Ala Lys Leu Trp Asp Val Ser 
165 170 175 



25 



Thr Gly Gin Cys Val Tyr Gly He Gin Thr His Thr Cys Ala Ala Val 
180 165 190 



30 



Lys Phe Asp Glu Gin Lys Leu Val Thr Gly Ser Phe Asp Asn Thr Val 
195 200 205 

Ala Cys Trp Glu Trp Ser* Ser Gly Ala Arg Thr Gin His Phe Arg Gly 
210 215 220 



35 



His Thr Gly Ala Val Phe Ser Val Asp Tyr Ser Asp Glu Leu Asp He 
225 230 235 240 



Leu Val Ser Giy Ser Ala Asp Phe Ala Val Lys Val Trp Ala Leu Ser 
245 250 255 



40 



Ala Gly Thr Cys Leu Asn Thr Leu Thr Gly His Thr Glu Trp Val. 

260 265 270 



45 



Lys Val Val Leu Gin Lys Cys Lys Val Lys Ser Leu Leu His Ser Pro 
275 280 285 

Gly Asp Tyr He Leu Leu Ser Ala Asp Lys Tyr Glu He Lys He Trp 
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290 295 300 

Pro He Gly Arg Glu He Asn Cys Lys Cys Leu Lys Thr Leu Ser Val 
305 310 315 320 

5 

Ser Glu Asp Arg Ser He Cys Leu Gin Pro Arg Leu His Phe Asp Gly 
325 330 335 

Lys Tyr He Val Cys Ser Ser Ala Leu Gly Leu Tyr Gin Trp Asp Phe 
10 340 345 350 

Ala Ser Tyr Asp He Leu Arg Val He Lys Thr Pro Glu Val Ala Asn 
355 360 365 

15 Leu Ala Leu Leu Gly Phe Gly Asp Val Phe Ala Leu Leu Phe Asp Asn 

370 375 380 

His Tyr Leu Tyr He Met Asp Leu Arg Thr Glu Ser Leu He Ser Arg 
385 390 395 4OO 



20 



35 



45 



Trp Pro Leu Pro Glu Tyr Arg Lys Ser Lys Arg Gly Thr Ser Phe Leu 



405 410 

Ala Gly Glu Arg Pro Gly 
25 420' 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 
3 0 (A) LENGTH: 422 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



415 



(ii) MOLECULE TYPE: protein 
(iii) KYPOTKETICAL: NO 



40 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: MSLl, Fig. 36 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

Met Asn Gin Cys Ala Lys Asp He Thr His Glu Ala Ser Ser He Pro 
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15 ID 15 

lie Asp Leu Gin Glu Arg Tyr Ser His Trp Lys Lys Asn Thr Lys Leu 
20 25 30 

5 

Leu Tyr Asp Tyr Leu Asn Thr Asn Ser Thr Lys Trp Pro Ser Leu Thr 
35 40 45 

Cys Gin Phe Phe Pro Asp Leu Asp Thr Thr Ser Asp Glu His Arg lie 
10 50 55 60 

Leu Leu Ser Ser Phe Thr Ser Ser Gin Lys Pro Glu Asp Glu Thr lie 
65 70 75 80 

15 Tyr lie Ser Lys He Ser Thr Leu Gly His He Lys Trp Ser Ser Leu 

85 90 95 

Asn Asn Phe Asp Met Asp Glu Met Glu Phe Lys Pro Glu Asn Ser Thr 
100 105 110 

20 

Arg Phe Pro Ser Lys His Leu Val Asn Asp He Ser He Phe Phe Pro 
115 120 125 

Asn Gly Glu Cys Asn Arg Ala Arg Tyr Leu Pro Gin Asn Pro Asp He 
25 130 135 140 

He Ala Gly Ala Ser Ser Asp Gly Ala He Tyr He Phe Asp Arg Thr 
145 150 155 160 

3 0 Lys His Gly Ser Thr Arg He Arg Gin Ser Lys He Ser His Pro Phe 

165 170 175 

Glu Thr Lys Leu Phe Gly Ser His Gly Val He Gin Asp Val Glu Ala 
180 185 190 

35 

Met Asp Thr Ser Ser Ala Asp He Asn Glu Ala Thr Ser Leu Ala Trp 
135 200 205 

AiJr. :-ju Gin Gin Glu Ala L^u L-au L i-y: A .r Z rj Glv Gin 

40 210 215 220 

Val Gin Val Trp Asp He Lys Gin Tyr Ser His Glu Asn Pro He He 
225 230 235 240 

45 Asp Leu Pro Leu Val Ser He Asn Ser Asp Gly Thr Ala Val Asn Asp 

245 250 255 
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Val Thr Trp Met Pro Thr His Asp Ser Leu Phe Ala Ala Cys Thr Glu 
260 265 270 

Gly Asn Ala Val Ser Leu Leu Asp Leu Arg Thr Lys Lys Glu Lys Leu 
5 275 280 285 

Gin Ser Asn Arg Glu Lys His Asp Gly Gly Val Asn Ser Cys Arg Phe 
290 295 300 

10 Asn Tyr Lys Asn Ser Leu lie Leu Ala Ser Ala Asp Ser Asn Gly Arg 

305 310 315 320 

Leu Asn Leu Trp Asp lie Arg Asn Met Asn Lys Ser Pro lie Ala Thr 
325 330 335 

15 

Met Glu His Gly Thr Ser Val Ser Thr Leu Glu Trp Ser Pro Asn Phe 
340 345 350 

Asp Thr Val Leu Ala Thr Ala Gly Gin Glu Asp Gly Leu Val Lys Leu 
20 355 360 365 

Trp Asp Thr Ser Cys Glu Glu Thr lie Phe Thr His Gly Gly His Met 
370 375 380 



25 



30 



Leu Gly Val Asn Asp lie Ser Trp Asp Ala His Asp Pro Trp Leu Met 
385 390 395 400 

Cys Ser Val Ala Asn Asp Asn Ser Val His lie Trp Lys Pro Ala Gly 
405 410 415 

Asn Leu Val Gly His Ser 
420 



(2) INFORMATION FOR SEQ ID NO: 54: 

35 

(i) £ZCUH:JC3 CHARACTZZRISTICS : 

(A) IZl'Z^A: 816 dinino acid^ 
( 3 ) TV? i : am :.no acid 
iZ) ZCc OL Z J V : "jjH 1 r. c T; 

40 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 



45 (iv) ANTI-SENSE: NO 



wo 95/21252 



pcrajs95/oi2io 



- 135 - 

(vi) ORIGINAIi SOCmCE: 

(C) INDIVIDUAL ISOIJ'.rE: MUS MUSCULUS PROTEIN, Fig. 37 



5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:54: 

Phe Arg Met Asp Asn Ala Ser Thr Arg He Asp Glu Arg Phe Arg He 
1 5 10 15 

10 Asp Ala Tyr Ala Asn Ala Arg Tyr Pro Met Pro Arg Thr Glu He Asn 

20 25 30 

Ser Glu Gin Glu Asn Cys Glu Asn Thr He Thr Leu Glu Asp Ser Glu 
35 40 45 

15 

Gin Glu Asn Cys Glu Ala Ala Cys Met Pro Leu Glu Thr Glu Ser Glu 
50 55 60 

Gin Glu Asn Cys Glu Met Ser Ser His Glu Ser Tyr Thr Asn Ala Ala 
20 65 70 75 80 

Glu Thr Pro Glu Asn He Ser He Leu Ser Cys Leu Gly Glu Thr Ser 
85 90 95 

25 Gly Ala Leu Val Asp Thr Lys Thr He Ser Asp He Lys Thr Met Asp 

100 105 lie 

Pro Arg Val Ser Leu Thr Pro Ser Ser Asp Val Thr Gly Thr Glu Asp 
115 120 125 

30 

Ser Ser Val Leu Thr Pro Gin Ser Thr Asp Val Asn Ser Val Asp Ser 
130 135 140 

Tyr Gin Gly Tyr Glu Gly Asp Asp Asp Asp Glu Glu Asp Asp Glu Asp 
35 145 150 155 160 

Asp Lys Aro Gl/ Asp Ser asn Leu Pro Ser Leu Glu Asp Ser Asp Asn 
153 170 175 

40 Phe He Ser Cys Leu Glu Asn Ser Tyr He Pro Gin Asn Val Glu ." 

180 185 190 

Gly Glu Val Val Glu Glu Gin Ser Leu Gly Arg Arg Phe His Pro Tyr 
195 200 205 



45 



Glu Leu Glu Ala Gly Glu Val Val Glu Gly Gin Gly Gly Gly Ser Leu 
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210 215 220 

Phe Tyr Pro Tyr Glu Leu Glu Ala Gly Glu Val Val Glu Ala Gin Asn 
225 230 235 240 

5 

Val Gin Asn Leu Phe His Arg Tyr Glu Leu Glu Glu Gly Glu Val Val 
245 250 255 

Glu Ala Gin Val Val Gin Ser Met Phe Pro Tyr Tyr Glu Leu Glu* Ala 
10 .260 265 270 

Gly Glu Val Val Glu Ala Glu Glu Val Gin Gly Phe Phe Gin TVrg Tyr 
275 280 285 

IS Glu Leu Glu Ala Arg Glu Val He Gly Ala Gin Gly Gly Gin Gly Leu 

290 295 300 

Ser Arg His Tyr Gly Leu Glu Gly Gly Glu Val Val Glu Ala Thr Ala 
305 310 315 320 

20 

Val Arg Arg Leu He Gin His His Glu Leu Glu Glu Gly Glu Asp Val 
325 330 335 

Asp Asp Gin Glu Glu Ser Ser Glu Met His Glu Glu Thr Ser Glu Asp 
25 340 345 

Ser Ser Glu . Gin Tyr Asp He Glu Asp Asp Ser Leu He Asp Glu Trp 
355 360 365 

30 He Ala Leu Glu Thr Ser Pro Leu Pro Arg Pro Arg Trp Asn Val Leu 

370 375 380 

Ser Ala Leu Arg Asp Arg Gin Leu Gly Ser Ser Gly Arg Phe Val Tyr 
390 395 400 

35 

Glu Ala Cys Gly Ala Arg Leu Phe Val Gin Arg Phe Ser Leu Glu His 
"^OS 410 415 

Vdl r:- Clu Giy H-s :ny -ys :.. r. Thr Val :£i ; f.- Am Glri 

^0 420 425 430 

His Gly Thr Leu Leu Ala Ser Gly Ser Asp Asp Leu Lys Val He Val 
435 440 445 



45 



Trp Asp Trp Leu Lys Lys Arg Ser Val Leu Asn Phe Asp Ser Gly His 
450 455 460 



wo 95/21252 



PCTAJS95/01210 



- 137 - 

Lys Asn Asn lie Leu Cla Ala Lys Pha Leu Pro Asa Cys Asn Asp Ala 
465 470 475 460 

He Leu Ala Met Cys Gly Arg Asp Gly Gin Val Arg Val Ala Gin Leu 
5 485 490 495 

Ser Ala Val Ala Gly Thr His Met Thr Lys Arg Leu Val Lys His Gly 
500 505 510 

10 Gly Ala Ser His Arg Leu Gly Leu Glu Pro Asp Ser Pro Phe Arg Phe 

515 520 525 

Leu Thr Ser Gly Glu Asp Ala Val Val Phe Asn He Asp Leu Arg Gin 
530 535 540 

15 

Ala His Pro Ala Ser Lys Leu Leu Val He Lys Asp Gly Asp Lys Lys 
545 550 555 560 

Val Gly Leu Tyr Thr Val Phe Val Asn Pro Ala Asn Val Tyr Gin Phe 
20 565 570 575 

Ala Val Gly Gly Gin Asp Gin Phe Met Arg He Tyr Asp Gin Arg Lys 
580 585 590 

25 He Asp Glu Asn Val Asn Asn Gly Val Leu Lys Lys Phe Cys Pro His 

595 600 605 

His Leu Leu Ser Ser Asp Tyr Pro Ala His He Thr Ser Leu Met Tyr 
610 615 620 

30 - 

Ser Tyr Asp Gly Thr Glu He Leu Ala Ser Tyr Asn Asp Glu Asp He 
625 . 630 635 640 

Tyr He Phe Asn Ser Ser Asp Ser Asp Gly Ala Gin Tyr Ala Lys Arg 
35 645 650 655 

Tyr Lys Gly His Arg Asn Asn Ser Thr VtiI Lys Gly V=l Tvr Phe Tyr 
6oO 665 670 



40 



45 



Gly Pro Arg Ser Glu Phe Val Met Ser Gly Ser Asp Cys Gly His He 

675 680 685 

Phe He Trp Glu Lys Ser Ser Cys Gin He Val Gin Phe Leu Glu Ala 

690 695 700 

Asp Glu Gly Gly Thr He Asn Cys He Asp Ser His Pro Tyr Leu Pro 
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705 710 715 720 

Val Leu Ala Ser Ser Gly Leu Asp His Glu Val Lys lie Trp Ser Pro 
725 730 735 

5 

lie Ala Glu Pro Ser Lys Lys Leu Ala Gly Leu Lys Asn Val lie Lys 
740 745 750 

lie Asn Lys Leu Lys Arg Asp Asn Phe Thr Leu Arg His Thr Ser Leu 
10 755 760 765 

Phe Asn Asn Ser Met Leu Cys Phe Leu Met Ser His Val Thr Gin Ser 
770 775 780 

15 Asn Tyr Gly Arg Ser Trp Arg Gly lie Arg lie Asn Ala Gly Gly Gly 

785 790 795 800 

Asp Phe Ser Asp Ser Ser Ser Ser Ser Glu Glu Thr Asn Gin Glu Ser 
805 810 815 

20 

(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 422 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

30 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

35 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: ORF RBI, Fig. 38 



;.ci) ::iic:ii^ric:;: seq id iicts:: 

40 

Met Asn Gin Cys Ala Lys Asp lie Thr His Glu Ala Ser Ser lie Pro 
15 10 15 

lie Asp Leu Gin Glu Arg Tyr Ser His Trp Lys Lys Asn Thr Lys Leu 
45 20 25 30 
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Leu Tyr Asp Tyr Leu Asn Thr Asn Ser Thr Lys Trp Pro Ser Leu Thr 



35 



40 



45 



Cys Gin Phe Phe Pro Asp Leu Asp Thr Thr Ser Asp Glu His Arg He 
50 55 60 

Leu Leu Ser Ser Phe Thr Ser Ser Gin Lys Pro Glu Asp Glu Thr lie 
70 75 80 



10 



Tyr He Ser Lys He Ser Thr Leu Gly His lie Lys Trp Ser Ser Leu 
85 90 95 



15 



Asn Asn Phe Asp Met Asp Glu Met Glu Phe Lys Pro Glu Asn Ser Thr 
100 105 110 

Arg Phe Pro Ser Lys His Leu Val Asn Asp He Ser He Phe Phe Pro 
115 120 125 



20 



Asn Gly Glu Cys Asn Arg Ala Arg Tyr Leu Pro Gin Asn Pro Asp He 



130 



135 



140 



He Ala Gly Ala Ser Ser Asp Gly Ala He Tyr He Phe Asp Arg Thr 
145 150 155 



160 



25 



Lys His Gly Ser Thr Arg He Arg Gin Ser Lys He Ser His Pro Phe 
1^5 170 2^75 



30 



Glu Thr Lys Leu Phe Gly Ser His Gly Val He Gin Asp Val Glu Ala 



180 



185 



190 



Met Asp Thr Ser Ser Ala Asp He Asn Glu Ala Thr Ser Leu Ala Trp 



195 



200 



205 



35 



Asn Leu Gin Gin Glu Ala Leu Leu Leu Ser Ser His Ser Asn Gly Gin 
210 215 



220 



Val Gin Val Tr? Asp He Lys Gin Tyr Ser His Glu 



230 



Asn Pro He He 
240 



40 



Asp Leu Pro Leu Val Ser He Asn Ser Asp Gly Thr Ala Val A^. 

245 250 2S_ 



45 



Val Thr Trp Met Pro Thr His Asp Ser Leu Phe Ala Ala Cys Thr Glu 
260 265 270 

Gly Asn Ala Val Ser Leu Leu Asp Leu Arg Thr Lys Lys Glu Lys Leu 
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275 280 285 

Gin Ser Asn Arg Glu Lys His Asp Gly Gly Val Asn Ser Cys Arg Phe 
290 295 300 

5 

Asn Tyr Lys Asn Ser Leu He Leu Ala Ser Ala Asp Ser Asn Gly Arg 
305 310 315 320 

Leu Asn Leu Trp Asp He Arg Asn Met Asn Lys Ser Pro He Ala Thr 
10 325 330 335 

Met Glu His Gly Thr Ser Val Ser Thr Leu Glu Trp Ser Pro Asn Phe 
340 345 350 

15 Asp Thr Val Leu Ala Thr Ala Gly Gin Glu Asp Gly Leu Val Lys Leu 

355 360 365 

Trp Asp Thr Ser Cys Glu Glu Thr He Phe Thr His Gly Gly His Met 
370 375 380 

20 

Leu Gly Val Asn Asp He Ser Trp Asp Ala His Asp Pro Trp Leu Met 
390 395 400 

Cys Ser Val Ala Asn Asp Asn Ser Val His He Trp Lys Pro Ala Gly 
25 405 410 415 

Asn Leu Val Gly His Ser 
420 

30 (2) INFORMATION FOR SEQ ID NO: 56; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 576 amino acids 

(B) TYPE: amino acid 
35 (D) TOPOLOGY: unknown 

(ii) MOLECtn..=: TYPE: protein 

40 

(iv) AOTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Periodic Trp protein, Fig. 39 

45 
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(xi) SEQUENCE DESCRIPTICIf: SZQ ID NO: 56: 

Met lie Ser Ala Thr Asn Trp Val Pro Arg Gly Phe Ser Ser Glu Phe 
15 10 15 

5 

Pro Glu Lys Tyr Val Leu Asp Asp Glu Glu Val Glu Arg lie Asn Gin 
20 25 30 

Leu Ala Gin Leu Asn Leu Asp Asp Ala Lys Ala Thr Leu Glu Glu Ala 
10 35 40 45 

Glu Gly Glu Ser Gly Val Glu Asp Asp Ala Ala Thr Gly Ser Ser Asn 

50 55 60 

15 Lys Leu Lys Asp Gin Leu Asp lie Asp Asp Asp Leu Lys Glu Tyr Asn 

65 70 75 80 

Leu Glu Glu Tyr Asp Asp Glu Glu lie Ala Asp Asn Glu Gly Gly Lys 
85 90 95 

20 

Asp Val Ser Met Phe Pro Gly Leu Ser Asn Asp Ser Asp Val Lys Phe 
100 105 110 

His Glu Gly Glu Lys Gly Glu Asp Pro Tyr lie Ser Leu Pro Asn Gin 
25 115 120 125 

Glu Asp Ser Gin Glu Glu Lys Gin Glu Leu Gin Val Tyr Pro Ser Asp 
130 135 140 

30 Asn Leu Val Leu Ala Ala Arg Thr Glu Asp Asp Val Ser Tyr Leu Asp 

145 150 155 160 

lie Tyr Val Tyr Asp Asp Gly Ala Gly Phe His Ser Ser Asp lie Pro 
165 170 175 

35 

Val Glu Glu Gly Asp Glu Ala Asp Pro Asp Val Ala Arg Gly Leu Val 
IcO 135 190 

;..rg Asp ?:--) All Lou Tyr */al >:is :-:i3 .-zy Leu ::3t L^u Fro Ala Phe 
40 195 200 205 

Pro Leu Cys Val Glu Trp Leu Asp Tyr Lys Val Gly Ser Asn Ser Glu 
210 215 220 



45 



Glu Ala Ala Asn Tyr Ala Ala lie Gly Thr Phe Asp Pro Gin lie Glu 
225 230 235 240 
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lie Trp Asn Leu A^p Cys Val Asp Lys Ala Phe Pro Asp Met He Leu 
245 250 253 

Gly Glu Pro Leu Asp Asn Ser Met Val Ser Leu Lys Ser Lys Lys Lys 
260 265 270 

Lys Lys Lys Ser Lys Thr Gly His lie Thr Thr His His Thr Asp Ala 
275 280 285 



10 



Val Leu Ser Met Ala His Asn Lys Tyr Phe Arg Ser Val Leu Ala Ser 
290 295 300 



15 



Thr Ser Ala Asp His Thr Val Lys Leu Trp Asp Leu Asn Ser Gly Asn 
•305 310 315 320 

Ala Ala Arg Ser Leu Ala Ser He His Ser Asn Lys Asn Val Ser Ser 
325 330 335 



20 



Ser Glu Trp His Met Leu Asn Gly Ser He Leu Leu Thr Gly Gly Tyr 
340 345 350 



Asp Ser Arg Val Ala Leu Thr Asp Val Arg He Ser Asp Glu Ser Gin 
3S5 360 365 



25 



Met Ser Lys Tyr Trp Ser Ala Met Ala Gly Glu Glu He Glu Thr Val 
370 375 380 



30 



Thr Phe Ala Ser Glu Asn He He Leu Cys Gly Thr Asp Ser Gly Asn 
385 390 395 



400 



Val Tyr Ser Phe Asp He Arg Asn Asn Glu Asn Arg Lys Pro Val Trp 



405 



410 



415 



35 



Thr Leu Lys Ala His Asp Ala Gly He Ser Thr Leu Cys Ser Asn Lys 
420 425 430 



Phe II 2 Pro Gly Met Mat Ser Thr Gly Ala Met Gly Glu Lys Thr Val 

440 445 



40 



Lys Leu Trp Lys Phe Pro Leu Asp Asp Ala Thr Asn Thr Lys Gly 
450 455 460 



45 



Ser Met Val Leu Ser Arg Asp Phe Asp Val Gly Asn Val Leu Thr Ser 
470 475 480 

Ser Phe Ala Pro Asp He Glu Val Ala Gly Thr Met Val He Gly Gly 
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465 490 455 

Val Asn Lys Val Leu Lys Leu Trp Asp Val Phe Thr Asn Arg Ser Val 
500 505 510 

5 

Arg Lys Ser Phe Lys Ser Glu Leu Glu Asn Val Gin Ala Arg Ala Lys 
515 520 525 

Glu Glu Ala Gin Lys lie Gly Lys Ser Ser Arg lie Ala Arg Lys Tyr 
10 530 535 540 

Thr Ser Asn Asp Asn Pro Asp Thr Val lie Thr lie Asp Asp Gin Gly 
545 550 555 560 

15 Glu Asp Glu Glu Glu Arg Glu Gly Gly Asp Glu His Asp Asp Met Ala 

565 570 575 

(2) INFORMATION FOR SEQ ID NO: 57: 

20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 325 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

25 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
30 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: PLAP, Fig. 40 

35 

(xi) SEQLTENCE DESCRIPTION: SEQ ID NO: 57: 

■y.^t His Tyr Hot 3er Gly His Ser Asn Phe Val Ser Tyr Val Cys lie 
- 3 10 15 

40 

lie Pro Ser Ser Asp lie Tyr Pro His Gly Leu lie Ala Thr Gly .l; 
20 25 30 

Asn Asp His Asn lie Cys lie Phe Ser Leu Asp Ser Pro Met Pro Leu 
45 35 40 45 
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Tyr lie Leu Lys Gly His Ly3 Asp Thr Val Cys 3er Leu Ser Ser Gly 
50 £5 60 

Lys Phe Gly Thr Leu Leu Ser Gly Ser Trp Asp Thr Thr Ala Lys Val 
5 65 70 75 80 

Trp Leu Asn Asp Lys Cys Met Met Thr Leu Gin Gly His Thr Ala Ala 
85 90 95 

10 Val Trp Ala Val Lys lie Leu Pro Glu Gin Gly Leu Met Leu Thr Gly 

100 105 110 

Ser Ala Asp Lys Thr lie Lys Leu Trp Lys Ala Gly Arg Cys Glu Arg 
115 120 125 

15 

Thr Phe Leu Gly His Glu Asp Cys Val Arq Gly Leu Ala lie Leu Ser 
130 135 140 

Glu Thr Glu Phe Leu Ser Cys Ala Asn Asp Ala Ser lie Arg Arg Trp 
20 145 150 155 160 

Gin lie Thr Gly Glu Cys Leu Glu Val Tyr Phe Gly His Thr Asn Tyr 
165 170 175 

25 He Tyr Ser He Ser Val Phe Pro Asn Ser Lys Asp Phe Val Thr Thr 

180 185 190 

Ala Glu Asp Arg Ser Leu Arg He Trp Lys His Gly Glu Cys Ala Gin 
195 200 205 

30 

Thr He Arg Leu Pro Ala Gin Ser He Trp Cys Cys Cys Val Leu Glu 
210 215 220 

Asn Gly Asp He Val Val Gly Ala Ser Asp Gly He He Arg Val Phe 
35 225 230 235 240 

Thr Glu Ssr G^u Glu Arg Thr Ala Ser Ala Glu Glu He Lys Ala Ser 
2S5 250 255 

40 Leu Ser Arg Glu Ser Pro Leu He Ala Lys Val Leu Thr Thr Glu 

260 265 270 

Pro He He Thr Pro Val Arg Arg Thr Leu Pro Cys Arg Val Thr Arg 
275 280 285 



45 



Ser Met He Ser Ser Cys Leu Ser Arg Leu Val Ser Thr Ser Leu Ser 
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10 



15 



- 145 - 

2SQ 293 300 

Thr Ser Asp Ser His Leu Thr He Thr Ala Leu His Leu Phe Leu Thr 
305 310 315 320 

Thr Thr Thr Thr Glu 
325 

(2) INFORMATION FOR SEQ ID NO:58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 425 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown- 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

20 • (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: RETINOBLASTOMA BINDING PROTEIN - 
HUMAN, Fig. 41 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

Met Ala Asp Lys Glu Ala Ala Phe Asp Asp Ala Val Glu Glu Arg Val 
30 1 5 10 15 

He Asn Glu Glu Tyr Lys He Trp Lys Lys Asn Thr Pro Phe Leu Tyr 
20 25 30 

35 Asp Leu Val Met Thr His Ala Leu Glu Trp Pro Ser Leu Thr Ala Gin 

35 40 45 

Trp Leu Pro Asp Val Thr Arg Pre Glu Gly Lys Asp Phs Ser He His 
50 :5 :*0 

40 

Arg Leu Val Leu Gly Thr His Thr Ser Asp Glu Gin Asn His Leu Vai 
65 70 75 80 

He Ala Ser Val Gin Leu Pro Asn Asp Asp Ala Gin Phe Asp Ala Ser 
45 85 90 95 
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His T>'r Asp Ser Giu Lys Gly Glu Phe Gly Gly fha Gly Ser Val Ser 
100 105 110 

Gly Lys He Glu He Glu He Lys He Asn His Glu Gly Glu Val Asn 
115 120 125 

Arg Ala Arg Tyr Met Pro Gin Asn Pro Cys He He Ala Thr Lys Thr 
130 135 140 



10 



Pro Ser Ser Asp Val Leu Val Phe Asp Tyr Thr Lys His Pro Ser Lys 
145 150 155 160 



15 



Pro Asp Pro Ser Gly Glu Cys Asn Pro Asp Leu Arg Leu Arg Gly His 
165 170 175 

Gin Lys Glu Gly Tyr Gly Leu Ser Trp Asn Pro Asn Leu Ser Gly His 
180 185 190 



20 



Leu Leu Ser Ala Ser Asp Asp His Thr He Cys Leu Trp Asp He Ser 
195 200 205 



Ala Val Pro Lys Glu Gly Lys Val Val Asp Ala Lys Thr He Phe Thr 
210 215 220 



25 



Gly His Thr Ala Val Val Glu Asp Val Ser Trp His Leu Leu His Glu 
225 230 235 240 



30 



Ser Leu Phe Gly Ser Val Ala Asp Asp Gin Lys Leu Met He Trp Asp 
245 250 255 

Thr Arg Ser Asn Asn Thr Ser Lys Pro Ser His Ser Val Asp Ala His 
260 265 270 



35 



Thr Ala Glu Val Asn Cys Leu Ser Phe Asn Pro Tyr Ser Glu Phe He 
275 280 285 



Leu Ala Thr Gly Ser Ala Asp Lys Thr Val Ala Leu Tip Asp Leu Arg 
-^50 295 300 



40 



Asn Leu Lys Leu Lys Leu His Ser Phe Glu Ser His Lys Asp Glu I : : 
305 310 315 320 



45 



Phe Gin Val Gin Trp Ser Pro His Asn Glu Thr He Leu Ala Ser Ser 
325 330 335 

Gly Thr Asp Arg Arg Leu Asn Val Trp Asp Leu Ser Lys He Gly Glu 
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340 345 350 

Glu Gin Ser Pro Glu Asp Ala Glu Asp Gly Pro Pro Glu Leu Leu Phe 
355 360 365 

5 

lie His Gly Gly His Thr Ala Lys lie Ser Asp Phe Ser Trp Asn Pro 
370 375 ^ 380 

Asn Glu Pro Trp Val lie Cys Ser Val Ser Glu Asp Asn lie Met Gin 
10 385 390 395 400 

Val Trp Gin Met Ala Glu Asn He Tyr Asn Asp Glu Asp Pro Glu Gly 
405 410 415 

15 Ser Val Asp Pro Glu Gly Gin Gly Ser 

420 425 

(2) INFORMATION FOR SEQ ID NO: 59: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 852 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

25 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 



30 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: S253 PROTEIN, Fig. 42 



35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

Met Phe Lys Ssr Lys Thr Ser Thr Leu Ser Tyr Asp Glu Thr Pro Asn 

15 10 15 

40 Ser Asn Glu Gly Asp Arg Asn Ala Thr Pro Val Asn Pro Lys C-1 . 

20 25 30 

Ser Gin Thr Lys His Leu Asn He Pro Gly Asp Arg Ser Arg His Ser 
35 40 45 



45 



Ser He Ala Asp Ser Lys Arg Ser Ser Ser Arg Tyr Asp Gly Gly Tyr 
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50 



148 - 
55 



60 



Ssr Ala Asp lie lie Pro Ala Gin Leu Arg Phe lie Asp Asn lie Asp 
65 70 75 80 

Tyr Gly Thr Arg Leu Arg Lys Thr Leu His Arg Asn Ser Val Val Ser 
85 90 95 



10 



Asn Gly Tyr Asn Lys Leu Ser Glu Asn Asp Arg Trp Tyr Phe Asp Leu 
100 105 110 



Phe Asp Arg Lys Tyr Phe Glu Asn Tyr Leu Glu Glu Pro Thr Tyr lie 
115 120 125 



15 



Lys lie Phe Lys Lys Lys Glu Gly Leu Glu Gin Phe Asp Arg Met Phe 
130 135 140 



20 



Leu Ala Gin Glu Leu Lys lie Pro Asp Val Tyr Lys Ser Thr Thr Tyr 

145 150 155 160 

Gin Gly Glu Pro Ala Val Ala Asn Ser Glu Leu Phe Lys Asn Ser lie 

' 165 170 175 



25 



Cys Cys Cys Thr Phe Ser His Asp Gly Lys Tyr Met Val He Gly Cys 
180 185 190 



Lys Asp Gly Ser Leu His Leu Trp Lys Val He Asn Ser Pro Val Lys 
195 200 205 



30 



Arg Ser Glu Met Gly Arg Ser Glu Lys Ser Val Ser Ala Ser Arg Ala 
210 215 220 



35 



Asn Ser Leu Lys He Gin Arg His Leu Ala Ser He Ser Ser His Asn 
225 230 235 240 

Gly Ser He Sar Ser Asn 7isp Leu Lys Pro Ser Asp Gin Phe Glu Gly 
245 250 255 



40 



Pro 53r Lys :;ir. luu 

260 



265 



.2 Tyr .: v.- A3P Val 
270 



Phe Arg Val Phe Met Glu His Ala Leu Asp He Leu Asp Ala Asn Vi-p 
275 280 285 



45 



Ser Lys Asn Gly Phe Leu He Thr Ala Ser Met Asp Lys Thr Ala Lys 
290 295 300 
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Trp His Pro Glu Arg Lys Tyr Ser Lau Lys Thr Phe Val His Pro 
305 310 313 320 

Asp Phe Val Thr Ser Ala lie Phe Phe Pro Asn Asp Asp Arg Phe lie 
5 325 330 335 

lie Thr Gly Cys Leu Asp His Arg Cys Arg Leu Trp Ser lie Leu Asp 
340 . 345 350 

10 Asn Glu Val Ser Tyr Ala Phe Asp Cys Lys Asp Leu lie Thr Ser Leu 

355 360 365 

Thr Leu Ser Pro Pro Gly Gly Glu Tyr Thr lie lie Gly Thr Phe Asn 
370 375 380 

15 

Gly Tyr lie Tyr Val Leu Leu Thr His Gly Leu Lys Phe Val Ser Ser 
385 390 395 400 

Phe His Val Ser Asp Lys Ser Thr Gin Gly Thr Thr Lys Asn Ser Phe 
20 405 410 415 

His Pro Ser Ser Glu Tyr Gly Lys Val Gin His Gly Pro Arg lie Thr 
420. 425 430 

25 Gly Leu Gin Cys Phe Phe Ser Lys Val Asp Lys Asn Leu Arg Leu lie 

435 440 445 

Val Thr Thr Asn Asp Ser Lys lie Gin lie Phe Asp Leu Asn Glu Lys 
450 455 460 

30 

Lys Pro Leu Glu Leu Phe Lys Gly Phe Gin Ser Gly Ser Ser Arg His 
465 470 475 480 

Arg Gly Gin Phe Leu Met Met Lys Asn Glu Pro Val Val Phe Thr Gly 
35 485 490 495 

Ser Asp Asp His Trp Phe Tyr Thr Trp Lys Met Gin Ser Phe Asn Leu 
500 :505 510 

40 Ser Ala Glu Met Asn Cys Thr Ala Pro His Arg Lys Lys Arg Lcu o^r 

515 520 525 

Gly Ser Met Ser Leu Lys Gly Leu Leu Arg He Val Ser Asn Lys Ser 
530 535 540 



45 



Thr Asn Asp Glu Cys Leu Thr Glu Thr Ser Asn Gin Ser Ser Ser His 
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10 



15 



20 



25 



30 



35 



40 



45 
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545 



550 



555 



550 



Thr Phe Thr Asn Ser Ser Lys Asn Val Leu Gin Thr Gin Thr Val Gly 



565 



570 



575 



Ser Gin Ala He Lys Asn Asn His Tyr He Ser Phe His Ala Hi 



580 



His Asn 



585 

ser Pro Val Thr Cys Ala Ser He Ala Pro Asp Val Ala He Lys Asn 



595 



600 605 
Leu Ser Leu Ser Asn Asp Leu He Phe Glu Leu Thr Ser Gin Tyr Phe 



610 



615 



620 



Lys Glu Met Gly Gin Asn Tyr Ser Glu Ser Lys Glu Thr Cys Asp 
625 



635 



Lys Pro Asn His Pro Val Thr Glu Thr Gly Gly Phe Ser S 
645 650 



Asn 
640 



er Asn Leu 
655 



Ser Asn Val Val Asn Asn Val Gly Thr He Leu He Thr Thr Asp Ser 
660 665 670 

Gin Gly Leu He Arg Val Phe Arg Thr Asp He Leu Pro Glu He Arg 



675 



680 



685 



Lys Lys He He Glu Lys Phe His Glu Tyr Asn Leu Phe 



690 



His Leu Glu 



695 700 
Ala Ala Gly Lys He Asn Asn His Asn Asn Asp Ser He 



705 



Leu Glu Asn 
720 



710 715 
Arg Met Asp Glu Arg Ser Ser Thr Glu Asp Asn Glu Phe Ser Thr Thr 



725 



730 



735 



Pro Pro Ser Asn Thr His Asn Ser Arg Pro Ser His Asp Phe Cys Glu 
7-0 745 



750 



Leu Hi. -Jro v.: ri. ser Gly r-T.t Arg Ala 

755 760 

ser Ala He Phe Lys Asn Ser He Phe Asn Lys Ser Asn Gly Ser Phe 
"° 775 780 

He Ser Leu Lys Ser Arg Ser Glu Ser Thr Ser Ser Thr Val Phe Gly 
785 - 



790 



795 



800 
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Secondary structure analysis identifies a 
putative mouse protein demonstrating 
similarity to the repeat units found in CDC4, 
the G protein J3 subunits and related 
proteins 
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The predicted protein product of an anonymous clone isolated 
from a cDNA library prepared from 12 day post coitum (p.c) 
embryonic mouse heart tissue demonstrated the same segmental 
repeats previously identified in the cell division control protein, 
CDC4 and the G protein p^ subunit. A search of the protein dat- 
abase subsequently identified three other classes of protein con- 
taining the repeat. Secondary structure analyses performed on 
the repeat sequences revealed a high degree of conservation 
suggesting thai the repeat motif performs a specific function in a 
diverse range of proteins. 

KEY WORDS: G protein, mouse heart protein, repeat homology, 
secondary structure 



INTRODUCTION 

A study of genes expressed during early mouse 
heart development generated data for several novel 

anonymous sequences. The function of one of 
these genes was investigated using comparative 
and predictive methods. 

The similarity between an anonymous region of 
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Psychiatry, University of Southern Florida, Tampa, USA. Fax no 
0101 8139742952. 



DNA and sequence submitted to the databases is 
often too low to be informative. In this case, the 
DNA can be translated and the amino acid 
sequence screened against a protein sequence 
database using a best local alignment programme 
such as FASTP (Lipman and Pearson, 1985). It is 
possible for two protein sequences to have 
diverged yet still retain the same function if resi- 
dues with similar physiochcmical and spatial 
properties have been substituted. To account for 
this, a comparison between protein sequences is 
often assessed in terms of exact and conserved 
matches which increases the probability that 
related or similar proteins will be identified at the 
protein level when a comparison of the DNA 
sequence is uninformative. Furthermore, amino 
acid composition is not random and certain resi- 
dues (such as leucine) occur far more frequently 
than the least abundant residues (such as 
tryptophan) (Dooliltle, 1986). This bias in codon 
composition is therefore Informative when 
assessing the signficance of matched (or 
mismatched) residues. 

Similarities at the primary amino acid level are 
often a reflection of secondary structure conser- 
vation although exceptions have been identified 
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(Wilson et al., 1985). In the absence of X-ray crys- 
tallography studies, secondary structure predic- 
tions have been produced either from stereochem- 
ical and 'physical considerations of amino acid 
sequence data (Chou and Fasman, 1 974; Gamier et 
al., 1978; Lim, 1974), or by extrapolation from 
sequence of known secondary structure (Levin et 
al., 1986). The most commonly used methods are 
statistical, that is, they are based on the observed 
frequency with which individual residues are 
found in given structural states. The overall accu- 
racy of these methods has been assessed as 
50-59% (Kabsch and Sander, 1983) and structural 
predictions produced by these methods should 
therefore be assessed with caution. 

Predictions of a helix structure combined with 
hydrophobiclty or charge distribution plots can be 
represented as a helical wheel. The helical wheel 
can be used to demonstrate the clustering of simi- 
lar residues around the helix as the 3.6 residue 
pitch of the helix brings residues at positions a, a±l 
and a±4 together. One particular application is in 
the identification of amphipathic helices. Thus the 
assignment of similar residues to specific sides or 
faces of the helix can reveal associations that may 
not have been obvious from the linear amino acid 
sequence. 

The use of comparative methods to elucidate the 
function of an anonymous sequence requires cau- 
tious interpretation of computer generated results. 
Sequence analysis has however been successfully 
employed to elucidate (he nature of a number of 
proteins; the functions of which have subsequently 
been confirmed by biochemical and genetic analy- 
ses (reviewed by Hodgman, 1986). 



RESULTS 

Northern blot analysis using the full length cDNA 
of phage clone 1 as a probe identified a single tran- 
script of approximately 1 .8 kb in mKNA isolated 
from the 12-day p.c. mouse heart preparation (data 
not shown). Clone 1 was sequenced in both the 
sense and antisense orientations. The presence of a 
poly (A) tail at the 3' terminus and the high per- 
centage of G/C nucleotides at the 5' end together 
with the identification of a single transcript of 
equivalent length suggested that the clone was full 
length. The sequence was translated in all three 
reading frames from the first AUG (methionine) 
triplet in each frame to the first stop codon. The 



longest open reading frame (1.2 kb) coded for a 
protein of 422 amino acids (Fig. 1). Codon prefer- 
ence analysis (data not shown) confirmed that this 
was the correct open reading frame. A possible 
frameshift was indicated very close to the C-ter- 
minal although this was not confirmed in the DNA 
sequence data from either the plus or minus strand. 

DNA sequence data was screened against the 
Genbank and EMBL databases. Both the complete 
cDNA sequence and the DNA sequence of the 
coding region failed to show significant similarity 
to DNA sequences submitted to the databases. The 
predicted protein sequence was then screened 
against the PIR database. Overall, the protein 
showed low similarity to protein sequences sub- 
mitted to the database. One region, however, was 
similar to a region present in a yeast cell division 
control protein (CDC4), the three known (i sub- 
units of the guanine nucleotide-binding protein (G 
protein) complex (Fong ct aL, 1986, 1987; Levine 
et al., 1 990), an AAC rich mRNA fragment isolated 
from D. discoideum (AAC3) (Shaw et ai, 1989) 
and the S. cerevisiae protein, TUP1 (Williams and 
Trumbly, 1990). 

Pairwise alignments were performed between 
clone 1 and sequences identified by the database 
search. The alignment between clone 1 and CDC4 
(Fig. 2). produced the greatest number of exact 
matches (29%). This figure increased to 79% when 
both conserved and exact matches were taken into 
account. The three /? subunits and the AAC3 pro- 
tein shared 17-19% exact residues with clone 1 
(data not shown). This figure increased to greater 
than 50% when both conserved and exact matches 
were taken into account. Dot matrix comparisons 
were made of the human fi2 subunit (Fong et ai, 
1987) both with itself and with the clone 1 
sequence (Fig. 3). Internal repeats within the clone 
1 sequence were identified by a comparison of the 
sequence with itself. The repeat units in clone 1 
were subsequently identified and aligned together 
with the repeats from AAC3, CDC4 and the human 
G protein p2 subunit. The p\ sequence was not 
included because it was virtually identical to p2. 

Secondary structure predictions were performed 
on several of the repeat sequences; the results are 
shown in Fig. 4. Some of the /J-strands may extend 
for a residue longer than shown. All other regions 
were predicted to form /?*lurns or unstructured coil 
except the sequence insertion in repeal 4 of clone 
1 which scored as helix. This inserted region was 
represented as a helical wheel (Fig. 5). The 
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TGGCTGTGGAGCGGACCCGGCCGCTGCGACGCTCTGGCGGCCCGAGCGCGCCTAGTCGGTGTGAGCCCGGCGCGAG 
GTCCCGGGCCCCGGGGCGCTCGCTCAGGTAATATTTCCATAACCTT 

ATG GAG AGA AAG GAC TTT GAG ACA TCG CTT GAT AAC ATT TCT CTT ACA TTT CTT TCT CTC ATG GAC TTC 

MERKDFETWLDNISVTFLSLMDL 
1 

GAG AAA AAT GAA ACT CTG GAC CAC CTG ATT AGT CTG AGT GGG GCA CTC CAG CTC ACG CAT CTC TCC AAT 
QKNETLDHLISLSCAVOLRHLSN 

AAC CTG CAG ACT CTC CTC AAC CGG GAC TTC CTC AAA CTC CTT CCC CTG GAG CTC AGT TTT TAT TTG TTA 

NLETLLKROFLKLLPLELSFYLL 
SO 

AAA TGG CTC GAT CCT CAG ACT TTA CTC ACA TGC TGC CTC CTC TCT AAG CAG CGG AAT AAG CTC ATA ACT 
KWLDPQTLLTCCLVSKORNKVIS 

GCC TGT ACA CAG CTG TGG CAC ACT GCA TCT AAA AAT TTC CCC TCG CAC ATA CAT CAT TCT CTT CAC GAC 

'^CTEVWOTACKNLCWQIDDSVOD 

100 

TCA TTG CAC TGC AAG AAC GTT TAT TTG AAC CCT ATT TTG AGG (^TG AAG CAA CTG CAG GAC CAT GAA GCC 
SLHWKKVYLKAILRMKOLEDHEA 

TTT CAG ACC TCT TCC TTA ATT CCA CAT AGT GCC AGA CTG TAT GCA CTT TAC TAC AAA GAT GCA CTT CTC 

^ETSSLICHSARVYALYYKDCLL 

150 

TGT ACA GCC TCA GAT CAC TTC TCT CCA AAC CTG TGG GAT GTA ACC ACA CCC CAG TGT CTT TAC CCC ATC 
C^TGSDDl. SAKLWOVSTCQCVYCI 

CAC ACC CAC ACT TGT GCA GCT GTG AAC TTC GAT GAA CAG AAG CTT GTG ACA CCC TCC TTT GAC AAC ACT 
OTHTCAAVKFDEQKLVTGSFDNT 

200 

CTC GCC TGC TGC GAC TGG AGT TCC GGA GCC AG.G ACC CAG CAC TTC CGG CCC CAC ACC CGG CCC CTG TTC 
VACWEWSSCARTQHFRGHTGAVF 

AGT CTC CAC TAC AGT CAT GAA CTC CAT ATT TTC GTG ACT GCC TCT CCC CAC TTC CCT CTG AAA GTA TGG 
SVDYSDELDI LVSCSADFAVKVW 

250 

GCT TTA TCT CCT GGG ACA TGC CTG AAT ACA CTC ACT GGG CAT ACT GAA TGG CTC ACC AAC CTC CTT TTC 
'^I'SAGTC LNTLTGHTEWVTKVVL 

CAG AAG TGC AAA GTC AAG TCT CTC TTG CAC ACC CCT GGA GAC TAC ATC CTC TTA AGT GCA GAC AAA TAT 
QKCKVKSLLHSPGDYILLSAOKY 

GAG ATC AAG ATT TGG CCA ATT CCC AGA GAA ATC AAC TGT AAC TGC TTG AAC ACA CTG TCT GTC TCT GAG 

EIKIWPICREINCKCLKTLSVSE 

300 

GAT AGA AGT ATC TGC CTG CAG CCA AGA CTT CAT TTT GAT GGA AAA TAC ATT CTC TCT AGT TCA CCC CTC 
DRSICLQPRLMFDGKYIVCSSAL 

GGT CTG TAC CAG TGG GAC TTT GCC AGT TAT GAT ATT CTC AGG GTC ATC AAG ACA CCT GAG GTA CCA AAC 

GLYQWDFASYDILRVIKTPEVAN 
350 

TTG CCC TTC CTT CGC TTT CCA GAT GTC TTC GCC CTC CTG TTT CAC AAC CAC TAC CTA TAT ATC ATG GAC 
LALLGFCDVFALLFDNHYLY IMD 

TTC AGG ACA GAC ACC CTA ATT ACC CCC TGG CCT CTG CCA CAG TAC AGG AAA TCA AAG ACA CGC TCC ACC 
I-RTESLI SRWPLPEYRKSKRCSS 

400 

TTC CTG GCA GCC GAA CGT CCT GGT 
FLAGERPG 

TGAATGGATTGGATGGGCACAATGACACGGGCTTAGTCTTTGCCACCAGCATGCCTGACCACAGTATTCACCTGGT 
GTTATGGAAGGAGCATTGCTGACACCAGGAGCTACCACCGCTGACTGACTTTGGGTGCCAGGGCTGCGGGTTTTGG 
GTGCAATGTCTATGGCAGCCAACTGCATGAACCAAAGTTCTCACCTAAAGGTATCATCACGCAGTGCACAATCATT 
TATCTGTTTGCCAGGGCTGGGGCGGGGAGGGCTTGTTTACTGACATACACCGCAGCATGCTAATGGGATACACCAT 
TGACTTCATTTGATCTTAGTTATGTTGGTCAGTGTAAGAGAGGTTGCATTTTTGGATTTATCTTTCTGAGTGGAAT 
ATTGAGTAAAGAAAGTTAAATGATTCACTAATCTGCCTAATTGGTTGCCCATAAAA 

Figure 1 Complete cDNA and predicted amino acid sequence of clone I. The cDNA sequence was tonfirrned for both the plus and 
minus strands. The cONA sequence has been submitted to the EMBL database (accession number EMBU X54352). The op(?n reading frame 
was taken from the methionine residue (ATC) to the first in frame slop codon (TCA). The codon bias of the sequence confirmed that this 
sequence represents the protein coding region. The deduced amino acid sequence of 422 residues is represented by the single letter code. 
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CLONE 1 
CDC4 YEAST 



MERKDFETW-LDMISVTF 



ISLMDLQK METLDHLISLS — 



OEYQRKRAAGSGESTPERSDFKRVKHDNHKTLHPVNMNTGAASTON^iGZHNLTOI 



SNDA 



EKLLMSVDDGSAAPSTLSVNMGVASHNVAAPTTVNAATITGSDVSNi;;;,JsA^r;;;;;;;^EG 

* 

AVQLR 

ALPI,SPTASSPGTrTPIAKTTKTim«™;;^Zx"ESi;^SI^i;E;;ZsDEi;^^ 



-QLEDH EAFETSS L 

TCLQFEDNYVITGADDKMIRVYDSINKKFLLQL 



ILRMK 

IFILKNWYNPKFVPQRTTLRGHMTSVI 



-LVTGSFDNTVACWEWSSGARTQH 

YiyTGSRDNTLHVWKLPKESSVPDHGEEHDVPLVFHTPEEMp"y;^^lRGl!^S 



FRGHTGAVFSVDY 
VRTV — 

* * * 



SADKYEIKIWPIGR 
WDLE — 



-EINCKCLKTLSV- 



• •* ** * *....*..*..*» 



T?^!!? ™^ i-RVIKTPEVANLALLGFGD VFAL 

IRGWDANDYSRKFSYHHTNLSAITTFYVSDNI 



-LFDN 



LVSGSENQFNIYNLRSGKLVHANILKDA 
* * * 



HYLYIMDLRTESLISR- 



DQIWSVNFKGKTLVAAVEKDGQSFLE 



WPLPEYRKSKR- 



LDFSKASKINYVSNPVNSSSSSLESISTS 



GSSFLAGERPG 



LGLT 



RTTIIP 
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Figure 5 Helical wheel repfesentaiion of the insert sequence 
LWVTKVVLQ from repeat 4 of done 1, The positions of residues 
in ihc sequence are indicated around the helix. 



clustering of four hydrophobic residues (three val- 
ines and tryptophan) to one face suggests that the 
region is very strongly amphipathic. 

DISCUSSION 

Clone 1 demonstrated significant similarity to the 
CDC4 yeast cell cycle regulator, to the /? subunits 
of the human and bovine C proteins (Fong et ai, 
1986, 1987; Levine et al., 1990) and to the trans- 
lated product of the ACC3 gene (Shaw et ai, 
1989). 

Alignments between three known proteins and 
clone 1 identified a repetitive segmental structure 
that had previously been identified in the C protein 
/?1 subunit and the C-terminal region of CDC4 
(Fong el ai, 1986). The reported sequence con- 
sisted of 86 residues composed of two 43 residue 
segments. The smaller repeat unit was redefined 
for secondary structure analysis. Although the pri- 
mary amino acid sequences had diverged con- 
siderably between the proteins, the large number 
of conserved substitutions was reflected in the con- 
servation of the predicted secondary structure. Any 
functional similarity suggested by the sequence 
conservation between the known genes is not 
immediately apparent. CDC4 is required for 
spindle pole body separation in the yeast mitotic 



cell cycle as well as having a role in sporulation 
(Yochem and Byers, 1987). The trimeric C protein 
complex is involved in signal transduction. The 
AAC containing transcripts appear to be develop- 
mentally regulated during spore generation (Shaw 
et al., 1989) although it is unknown whether the 
AAC transcript is translated in vivo. The consensus 
pattern was also found as a five or six copy repeat 
in a protein variously known as TtJPI (Williams 
and Trumbly, 1990) or SFL2 (Fujita et al., 1990) 
which is thought to mediate glucose repression. 
Later database searches also identified the repeat 
sequence in a yeast protein (periodic tryptophan 
protein, PWPl; Duronio et al., 1991) of unknown 
function and in the protein product of a D. mel- 
anogaster neurogenic gene known as enhancer of 
split (Hartley et al., 1988). The' significance of 
these results is as yet unknown. 

A highly amphipathic region was identified 
between residues 268 and 279 of the translated 
product of clone 1 . The under-represented residue 
tryptophan was located on the hydrophobic face of 
the helical wheel which increases the significance 
of this region in terms of sequence conservation 
and thus motif function. Amphipathic helical 
domains have been observed in the biologically 
active regions of small effector proteins, hormones 
and signalling sequences (Adelman et ai, 1986; 
Masters et ai, 1986; Vassarotti at ai, 1987). Inter- 
estingly, amphipathic helical regions have been 
found in the receptor binding domains at the C-ter- 
minal regions of G protein a subunits (Adelman et 
al., 1986). The authors suggest that charge distri- 
butions on the polar face of the helix may be 
responsible for the specificity of the a subunits for 
their receptors as subunits with similar charge dis- 
tributions cross react with each other's receptors. 
Thus the presence of an inserted region at the C- 
terminus of the clone 1 protein may confer 
specificity. 

Within the repeat, the occurrence and relative 
position of four residues (glycine, histidine, 
aspartic acid and tryptophan) together with inter- 
spersed hydrophobic residues suggested that a 
range of diverse proteins from distantly related 
species are similar. It is likely that the highly con- 
served residues impart some feature that is critical 
in maintaining the structure of the repeat motif. 
The glycine residue found between the first two 
regions of ^-strand is reminiscent of those found in 
some type 2 (inverse) /?-turns (Sibanda and 
Thornton, 1985). The substitution of serine for gly- 
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cine in repeal 5 of the fil subunit is lolerated as 
serine residues are also commonly found In tight 
turns. Conserved histtdines play a major role as 
active site residues or as controllable elements in 
conformational changes though their role here is 
uncertain. Negatively charged aspartate is a promi- 
nent turn former which is commonly found as a 
helix starter. Tryptophan is a large rigid hydro- 
phobic residue which is usually buried. The role of 
the charged residue normally present at position 
1 5 within this hydrophobic region is as yet unclear. 
The terminal glycine residue may assist in forming 
flexible links between one repeat and the next. 

In most cases, deviation from the consensus 
sequence resulted in the substitution of similar resi- 
dues. The substitution of residues which are not 
obviously similar may indicate functional differ- 
ences or it may reflect a tertiary level constraint or 
some property of the residue that is specific to this 
particular environment.. Although the accuracy of 
secondary structure predictions has been assessed 
as less than 60% (Kabsch and Sander, 1983), the 
fact that several repeat motifs identified in different 
genes demonstrate a conserved structure indicates 
that the predictions are correct. 

Proteins are only said to be homologous if they 
are descended from a common ancestor. It is often 
difficult to predict whether sequence similarity is 
the result of divergence from a common gene or 
due to evolutionary convergence in protein 
domains with similar functions. Fong era/. (1986) 
consider that the CDC4 protein is homologous to 
the P\ subunit of the C proteins due to the occur- 
rence and periodic repetition of the consensus 
sequence even though the overall conservation of 
residues is only approximately 19%. If Fong et al. 
{ 1 986) are correct in their prediction, it may be that 
clone 1, AAC3, TUPl, PWPl, and enhancer of split 
are also homologous. It is likely that the true 
relationship will only become apparent from bio- 
chemical analyses. 



Generation of sequence data 

The Eco Rl dit-esied inserl ONA was subcloned inio the 
compfemeniary site of M13mpl9 and reconibinant phage 
identified by X Cal selection. Recombinant Ml 3 DNA was 
prepared for sequencing by standard methods (Maniatis et al., 

1982) . Sequence data from recombinant phage containing 
inserts cloned in both orientations was generated using a 
Sequenase 2.0 kit (USB) according to manufacturer's rec- 
ommendations and visualized by autoradiography following 
electrophoresis through a 6% polyacrylamide gel. 

Sequence analyses 

The cDNA sequence was first screened against the Cenbank 
database using the FASTA program (Wilbur and Lipman, 

1983) . then translated in all three frames and screened against 
the PIR database using the FASTP program (Lipman and Pear- 
son, 1985), Alignments beiween similar sequences were per- 
formed using the CLUSTAL pairwise alignment program 
(HiKgins and Sharp, 1988). Interpol repeat analysis and dot 
malrix comparisons were performed using the DIACON pro- 
gram (Sladeri, 1982). Secondory structure predictions were 
performed using the method of Garnier et al. (1978); the heli- 
cal wheel analysis was performed using the Analysep program 
of Staden (1984). 

Northern blot analysis 

1 //g of mRNA was electrophoresed under denaturing con- 
ditions, then blotted overnight onto Cenescreen Plus 
(Duponl), Hybridization was carried out overnight with the 
oligolabellcd cDNA probe under standard conditions 
(Maniatis et at., 1982). Non-specifically bound probe was 
removed by washing in O.lxSSC, 1% SDS for 1 h at 6S«C. The 
membrane was then exposed to Fuji film al -70*C for three 
days. 
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MATERIALS AND METHODS 

Preparation of mRNA and construction of the library 

RNA was extracted from 12-day p.c. embryonic mouse hearts 
by the method of Cathala et j/. (1983). Poly A tailed KNA 
was isolated using Poly A Quick columns (Strategene) accord- 
mg to manufacturer's recommendations. cDNA was synthe- 
sized using a cDNA synthesis plus kit (Amershamj and the 
library constructed in AgtIO by standard methods (Maniatis et 
al., 1982), UNA from an Individual plaque was prepared for 
subcloning by standard methods (Maniatis et al.. 1982). 
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Pro His Asp lie Pro Arg Val Ser Thr Thr Tyr Pro Lys Leu Lys Cys 
805 810 815 

Asp Val Cys Asn Gly Ser Asn Phe Glu Cys Ala Ser Lys Asn Pro He 
5 820 825 830 

Ala Gly Gly Asp Ser Gly Phe Thr Cys Ala Asp Cys Gly Thr He Leu 
835 840 845 

10 Asn Asn Phe Arg 

850 



(2) INFORMATION FOR SEQ ID NO: 60: 

15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 488 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

20 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
25 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: SOFl, Fig. 43 

30 <xi) SEQUENCE DESCRIPTION: SEQ ID NO:60: 

Met Lys He Lys Thr He Lys Arg Ser Ala Asp Asp Tyr Val Pro Val 
15 10 15 

35 Lys Ser Thr Gin Glu Ser Gin Met Pro Arg Asn Leu Asn Pro Glu Leu 

20 25 30 



Hi3 Pro Phe Glu Arg Ala Arg Glu Tyr Thr Lys Ala Leu Asn Ala 



Thr 



40 



Lys Leu Glu Arg Met Phe Ala Lys Pro Phe Val Gly Gin Leu Gly fyr 
50 55 60 

Gly His Arg Asp Gly Val Tyr Ala He Ala Lys Asn Tyr Gly Ser Leu 
45 65 70 75 80 
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Asn Lys Leu Ala Thr Gly Ser Ala Asp Gly Val He Lys Tyr Trp Asn 
85 . 90 55 

Met ser Thr Arg Glu Glu Phe Val Ser Phe Lys Ala His Tyr Gly Leu 
5 100 105 

Val Thr Gly Leu Cys Val Thr Gin Pro Arg Phe His Asp Lys Lys Pro 
115 120 125 

10 Asp Leu Lys Ser Gin Asn Phe Met Leu Ser Cys Ser Asp Asp Lys Thr 

130 135 . 140 

Val Lys Leu Trp Ser He Asn Val Asp Asp Tyr Ser Asn Lys Asn Ser 
"5 150 155 160 

15 

Ser Asp Asn Asp Ser Val Thr Asn Glu Glu Gly Leu He Arg Thr Phe 
165 170 175 

Asp Gly Glu Ser Ala Phe Gin Gly He Asp Ser His Arg Glu Asn Ser 
20 180 185 190 

Thr Phe Ala Thr Gly Gly Ala Lys He His Leu Trp Asp Val Asn Arg 
195 200 205 

25 Leu Lys Pro Val Ser Asp Leu Ser Trp Gly Ala Asp Asn He Thr Ser 

210 215 220 

Leu Lys Phe Asn Gin Asn Glu Thr Asp He Leu Ala Ser Thr Gly Ser 
230 235 240 



30 



35 



40 



45 



Asp Asn Ser He Val Leu Tyr Asp Leu Arg Thr Asn Ser Pro Thr Gin 
245 250 255 

Lys He Val Gin Thr Met Arg Thr Asn Ala He Cys Trp Asn Pro Met 
260 265 270 

Glu Ala Phe A3n Phe Val Thr Ala Asn Glu Asp His Asn Ala T>t: Tyr 

280 285 

Tyr Asp Met Arg Asn Leu Ser Arg Ser Leu Asn Val Phe Lys Asp :.. 
290 295 300 

Val Ser Ala Val Met Asp Val Asp Phe Ser Pro Thr Gly Asp Glu He 
310 315 320 

Val Thr Gly Ser Tyr Asp Lys Ser He Arg He Tyr Lys Thr Asn His 
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325 330 335 

Gly His Ser Arg Glu lie Tyr His Thr Lys Arg Met Gin His Val Phe 
340 345 350 

5 

Val Lys Tyr Ser Met Asp Ser Lys Tyr lie He Ser Gly Ser Asp Asp 
355 360 365 

Gly Asn Val Arg Leu Trp Arg Ser Lys Ala Trp Glu Arg Ser Asn Val 
10 370 375 380 

Lys Thr Thr Arg Glu Lys Asn Lys Leu Glu Tyr Asp Glu Lys Leu Lys 
385 390 395 400 

15 Glu Arg Phe Arg His Met Pro Glu He Lys Arg He Ser Arg His Arg 

405 410 415 

His Val Pro Gin Val He Lys Lys Ala Gin Glu He Lys Asn He Glu 
420 425 430 

20 

Leu Ser Ser He Lys Arg Arg Glu Ala Asn Glu Arg Arg Thr Arg Lys 
435 440 445 

Asp Met Pro Tyr He Ser Glu Arg Lys Lys Gin He Val Gly Thr Val 
25 450 455 460 

His Lys Tyr Glu Asp Ser Gly Arg Asp Arg Lys Arg Arg Lys Glu Asp 
465 470 475 480 

30 Asp Lys Arg Asp Thr Gin Glu Lys 

485 

(2) INFORMATION FOR SEQ ID NO: 61: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 423 amino acids 

(B) TYPE: amino acid 
{D) TOPOLOGY: unknown 

40 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

45 

(vi) ORIGINAL SOURCE: 



W095/2J.2S2 



PCTA;S95/0I210 

- 154 - 

(C) IiOIVIDUAL I30IATE: STE4 - YEAST, Fig. 44 



15 



20 



25 



30 



35 



40 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

Met Ala Ala His 61n Met Asp Ser lie Thr Tyr Ser Asn Asn Val Thr 
Gin Gin Tyr He Gin Prd Gin Ser Leu Gin Asp He Ser Ala Val 



10 20 25 



Glu 
30 



Asp Glu He Gin Asn Lys He Glu Ala Ala Arg Gin Glu Ser Lys Gin 
35 40 45 

Leu His Ala Gin He Asn Lys Ala Lys His Lys He Gin Asp Ala Ser 
50 55 60 

Leu Phe Gin Met Ala Asn Lys Val Thr Ser Leu Thr Lys Asn Lys He 
" '° 75 80 

Asn Leu Lys Pro Asn He Val Leu Lys Gly His Asn Asn Lys He Ser 



85 90 



95 



Asp Phe Arg Trp Ser Arg Asp Ser Lys Arg He Leu Ser Ala Ser Gin 



100 



105 



110 



Asp Gly Phe Met Leu He Trp Asp Ser Ala Ser Gly Leu Lys Gin Asn 
lis 120 

Ala He Pro Leu Asp Ser Gin Trp Val Leu Ser Cys Ala He Ser Pro 
130 135 140 

ser ser Thr Leu Val Ala Ser Ala Gly Leu Asn Asn Asn Cys Thr He 



"5 150 1^55 



160 



Tyr Arg Val Ser Lys Glu Asn Arg Vai Ala Gin Asn Val Ala Ser He 
165 170 

:-ho Lys Gly His :-hr Cys Tyr He S.r .--p He Glu rh. A=o Asn 
180 185 190 

Ala His He Leu Thr Ala Ser Gly Asp Met Thr Cys Ala Leu Trp Asp 
"5 200 205 

He Pro Lys Ala Lys Arg Val Arg Glu Tyr Ser Asp His Leu Gly Asp 
2" 215 220 
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10 



15 



20 



25 



30 



35 



Val Leu Ala Leu Ala He Pro Giu Glu Pro Asn luu Glu Asn Ser Ser 
230 235 240 

Asn Thr Phe Ala Ser Cys Gly Ser Asp Gly Tyr Thr Tyr He Trp Asp 



245 



250 



255 



Ser Arg Ser Pro Ser Ala Val Gin Ser Phe Tyr Val Asn Asp Ser Asp 
260 265 270 

He Asn Ala Leu Arg Phe Phe Lys Asp Gly Met Ser He Val Ala Gly 
275 280 285 

Ser Asp Asn Gly Ala He Asn Met Tyr Asp Leu Arg Ser Asp Cys Ser 
290 295 300 

He Ala Thr Phe Ser Leu Phe Arg Gly Tyr Glu Glu Arg Thr Pro Thr 
310 315 320 

Pro Thr Tyr Met Ala Ala Asn Met Glu Tyr Asn Thr Ala Gin Ser Pro 
325 330 335 

Gin Thr Leu Lys Ser Thr Ser Ser Ser Tyr Leu Asp Asn Gin Gly Val 



340 



345 



350 



Val Ser Leu Asp Phe Ser Ala Ser Gly Arg Leu Met Tyr Ser Cys Tyr 
355 360 



365 

Thr Asp He Gly Cys Val Val Trp Asp Val Leu Lys Gly Glu He Val 
370 375 380 

Gly Lys Leu Glu Gly His Gly Gly Arg Val Thr Gly Val Arg Ser Ser 
390 395 400 

Pro Asp Gly Leu Ala Val Cys Thr Gly Ser Trp Asp Ser Thr Met Lys 
405 410 



415 



He Trp S-2r Pro Gly Tyr Gin 
4Z0 



40 

(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 704 amino acids 
^5 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

5 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: TRANSCRIPTION FACTOR TIIP, Fig, 45 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

Met Ser Leu Glu Val Ser Asn lie Asn Gly Gly Asn Gly Thr Gin Leu 
5 



10 



15 



25 



30 



40 



Asn Val 
45 



Ser His Asp Lys Arg Glu Leu Leu Cys Leu Leu Lys Leu He Lys Lys 
2° . 

Tyr Gin Leu Lys Ser Thr Glu Glu Leu Leu Cys Gin Glu Ala 
35 40 45 

ser ser Val Glu Leu Ser Glu He Ser Glu Ser Asp Val Gin Gin Val 
" 55 60 

Leu Gly Ala Val Leu Gly Ala Gly Asp Ala Asn Arg Glu Arg Lys His 

^° 75 eo 

Val Gin ser Pro Ala Gin Gly His Lys Gin Ser Ala Val Thr Glu Ala 

Asn Ala Ala Glu Glu Leu Ala Lys Phe He Asp Asp Asp Ser Phe Asp 



100 



105 



110 



Ala Gin His Tyr Glu Gin Ala Tyr Lys Glu Leu Arg Thr Phe Val Glu 



120 



125 



Asp 5er Leu 7,sp He Tyr Lys His Glu 



130 



135 



Leu Ser Met V9.I Leu Tyr Fro 



140 



45 



He Leu val Gin He Tyr Phe Lys He Leu Ala Ser Gly Leu k • 

"° 155 " 

Lys Ala Lys Glu Phe He Glu Lys Tyr Lys Cys Asp Leu Asp Gly Tyr 
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Tyr He Clu Giy Leu Phe Asn Leu Leu Leu Leu Ser Lys Pro Glu Glu 
180 185 190 

Leu Leu Glu Asn Asp Leu Val Val Ala Met Glu Gin Asp Lys Phe Val 
5 195 200 205 

He Arg Met Ser Arg Asp Ser His Ser Leu Phe Lys Arg His He Gin 
210 215 220 

10 Asp Arg Arg Gin Glu Val Val Ala Asp He Val Ser Lys Tyr Leu His 

225 230 235 240 

Phe Asp Thr Tyr Glu Gly Met Ala TVrg Asn Lys Leu Gin Cys Val Ala 
245 250 255 

15 

Thr Ala Gly Ser His Leu Gly Glu Ala Lys Arg Gin Asp Asn Lys Met 
260 265 270 

Arg Val Tyr Tyr Gly Leu Leu Lys Glu Val Asp Phe Gin Thr Leu Thr 
20 275 280 285 

Thr Pro Ala Pro Ala Pro Glu Glu Glu Asp Asp Asp Pro Asp Ala Pro 
290 295 300 

25 Asp Arg Pro Lys Lys Lys Lys Pro Lys Lys Asp Pro Leu Leu Ser Lys 

305 310 315 320 

Lys Ser Lys Ser Asp Pro Asn Ala Pro Ser He Asp Arg He Pro Leu 
325 330 335 

30 

Pro Glu Leu Lys hsp Ser Asp Lys Leu Leu Lys Leu Lys Ala Leu Arg 
340 345 350 

Glu Ala Ser Lys Arg Leu Ala Leu Ser Lys Asp Gin Leu Pro Ser Ala 
25 355 360 365 

Val Fhs Tyr Thr Val Leu Asn Ser His Gin Gly Val Thr Cys Ala Glu 
370 375 380 



40 



45 



He Ser Asp Asp Ser Thr Met Leu Ala Cys Gly Phe Gly Asp Ser .: : 
385 390 395 40 J 

Val Arg He Trp Ser Leu Thr Pro Ala Asn Val Arg Thr Leu Lys Asp 

405 410 415 

Ala Asp Ser Leu Arg Glu Leu Asp Lys Glu Ser Ala Asp He Asn Val 
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10 



15 



20 



25 



30 



35 



40 



45 



420 



425 



430 



Arg Met Leu Asp Asp Arg Ser Gly Glu Val Thr Arg Ser Leu Met Gly 



435 



440 



445 



His Thr Gly Pro Val Tyr Arg Cys Ala Phe Ala Pro Glu Met Asn Leu 



450 



455 



460 



Leu Leu Ser Cys Ser Glu Asp Ser Thr lie Arg Leu Trp Ser Leu 



465 



470 



475 



Leu 
460 



Thr Trp ser Cys Val Val Thr Tyr Arg Gly His Val Tyr Pro Val Trp 



485 



490 



495 



Asp val Arg Phe Ala Pro His Gly Tyr Tyr Phe Val Ser Cys Ser Tyr 
500 505 



510 



Asp Lys Thr Ala Arg Leu Trp Ala Thr Asp Ser Asn Gin Ala Leu 



515 



Arg 



520 525 
Val Phe Val Gly His Leu Ser Asp Val Asp Cys Val Gin Phe 



530 



His Pro 



535 

Asn ser Asn Tyr Val Ala Thr Gly Ser Ser Asp Arg Thr Val Arg Leu 



545 



550 



555 



560 



Trp Asp Asn Met Thr Gly Gin Ser Val Arg Leu Met Thr Gly His Lys 



565 



570 



575 



Gly ser Val Ser Ser Leu Ala Phe Ser Ala Cys Gly Arg Tyr Leu Ala 
580 585 



590 



ser Gly Ser Val Asp His Asn He He He Trp Asp Leu Ser" 



595 



600 



Asn Gly 



605 



ser Leu Val Thr Thr Leu Leu Arg His Thr Ser Thr Val Thr Thr He 

^-5 520 

7^2 ^or Arg :..p r^.y I :.ir. l,,, 

630 635 
Asn Leu Thr Leu Trp Asp Phe His Lys Val Thr Glu Asp Tyr He Ser 



645 



650 



655 



Asn His He Thr Val Ser His His Gin Asp Glu Asn Asp Glu Asp Val 
««0 665 



670 



^^^^.^252 PCT.'US95^1210 
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Tyr Leu Met Arg Thr Phe Pro Ser Lys Asn Ser Pro Phe Val Ser Leu 
675 680 685 

His Phe Thr Arg Arg Asn Leu Leu Met Cys Val Gly Leu Phe Lys Ser 
5 695 700 



(2) INFORMATION FOR SEQ ID NO: 63: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 713 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknc'vn 

15 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

20 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: TUPl, Fig. 46 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

Met Thr Ala Ser Val Ser Asn Thr Gin Asn Lys Leu Asn Glu Leu Leu 
15 10 15 

Asp Ala He Arg Gin Glu Phe Leu Gin Val Ser Gin Glu Ala Asn Thr 
20 25 30 

Tyr Arg Leu Gin Asn Gin Lys Asp Tyr Asp Phe Lys Met Asn Gin Gin 
35 40 45 

Leu Ala Glu Met Gin Gin He Arg Asn Thr Val Tyr Glu Leu Glu Leu 

55 60 

Thr ::i2 Ar^ Lyn v-c Lys Azp Al:- Tyr aiu Ala -31- lie Lys His L-u 
40 65 70 75 .. 

Lys Leu Gly Leu Glu Gin Arg Asp His Gin He Ala Ser Leu Thr Val 
85 90 95 



30 



35 



45 



Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Val Gin Gin His Leu 
"0 105 110 
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Gin Gin Gia Gin Gin Gin Lsu Ala Ala Ala Ser Ala Ser Val Pro Val 
"5 ,,0 

Ala Gin Gin Pro Pro Ala Thr Thr Ser Ala Thr Ala Thr Pro Ala Ala 
^ "5 

Asn Thr Thr Thr Gly Ser Pro Ser Ala Phe Pro Val Gin Ala Ser Arg 
"° 155 

10 Pro Asn Leu Val Gly Ser Gin Leu Pro Thr Thr Thr Leu Pro Val Val 

1« X70 „s 

Ser Ser Asn Ala Gin Gin Gin Leu Pro Gin Gin Gin Leu Gin Gin Gin 
^5 "° "5 „o 

Gin Leu Gin Gin Gin Gin Pro Pro Pro Gin Val Ser Val Ala Pro Leu 
195 200 205 

ser Asn Thr Ala He Asn Gly Ser Pro Thr Ser Lys Glu Thr Thr Thr 
'° 215 220 



25 



30 



40 



45 



Pro 
240 



Leu Pro ser Val Lys Ala Pro Glu Ser Thr Leu Lys Glu Thr Glu 
225 230 235 

Glu Asn Asn Asn Thr Ser Lys lie Asn Asp Thr Gly Ser Ala Thr Thr 

250 255 

Ala Thr Thr Thr Thr Ala Thr Glu Thr Glu He Lys Pro Lys Glu Glu 

265 270 

Asp Ala Thr Pro Ala Ser Leu His Gin Asp His Tyr Leu Val Pro Tyr 

280 285 

Asn Gin Arg Ala Asn His Ser Lys Pro He Pro Pro Phe Leu Leu Asp 

295 

Leu Asp ser Gin Ser V.l Pro Asp Ala Leu Lys Lys Gin Thr Asn Asp 

20= 310 -,- 

-1^ 320 

Tyr Tyr He Leu Tyr Asn Pro Ala Leu Pro Arg Glu lie Asp Vi. : . 
325 330 

Leu His Lys Ser Leu Asp His Thr Ser Val Val Cys Cys Val Lys Phe 

345 350 

Ser Asn Asp Gly Glu Tyr Leu Ala Thr Gly Cys Asn Lys Thr Thr Gin 
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355 



360 



365 



Val Tyr Arg Val Ser Asp Gly Ser Leu Val Ala Arg Leu Ser Asp Asp 
370 375 380 

Ser Ala Ala Asn Asn His Arg Asn Ser He Thr Glu Asn Asn Thr Thr 
385 390 395 400 



10 



Thr Ser Thr Asp Asn Asn Thr Met Thr Thr Thr Thr Thr Thr Thr He 
405 410 415 



Thr Thr Thr Ala Met Thr Ser Ala Ala Glu Leu Ala Lys Asp Val Glu 
420 425 430 



15 



Asn Leu Asn Thr Ser Ser Ser Pro Ser Ser Asp Leu Tyr He Arg Ser 
435 440 445 



20 



Val Cys Phe Ser Pro Asp Gly Lys Phe Leu Ala Thr Gly Ala Glu Asp 
450 455 460 

Arg Leu He Arg He Trp Asp He Glu Asn Arg Lys He Val Met He 
*65 470 475 480 



25 



Leu Gin Gly His Glu Gin Asp He Tyr Ser Leu Asp Tyr Phe Pro Ser 
485 490 495 



Gly Asp Lys Leu Val Ser Gly Ser Gly Asp Arg Thr Val Arg He Trp 
500 505 510 



30 



Asp Leu Arg Thr Gly Gin Cys Ser Leu Thr Leu Ser He Glu Asp Gly 
515 520 525 



35 



Val Thr Thr Val Ala Val Ser Pro Gly Asp Gly Lys Tyr He Ala Ala 
530 535 540 

Gly Ser Leu Asp Arg Ala Val Arg Val Trp Asp Ser Glu Thr Gly Phe 
S45 550 555 560 



Leu Val Glu Arg I*eu A-p Ter Glu Asn Glu Ser Gly Thr :.\y :ii.-5 Lys 
565 570 575 

Asp Ser Val Tyr Ser Val Val Phe Thr Arg Asp Gly Gin Ser Val Val 
580 585 ' 590 

Ser Gly Ser Leu Asp Arg Ser Val Lys Leu Trp Asn Leu Gin Asn Ala 
595 600 605 



wo 95/21252 PCT/US95/01210 

- 162 - 

Asn Asn Lys Ser Asp Ser Lys Thr Pro Asn Ser Gly Thr Cys Glu Val 
610 615 620 

Thr Tyr He Gly His Lys Asp Phe Val Leu Ser Val Ala Thr Thr Gin 
5 625 630 635 640 

Asn Asp Glu Tyr He Leu Ser Gly Ser Lys Asp Arg Gly Val Leu Phe 
645 650 655 

10 Trp Asp Lys Lys Ser Gly Asn Pro Leu Leu Met Leu Gin Gly His Arg 

660 665 670 

Asn Ser Val He Ser Val Ala Val Ala Asn Gly Ser Ser Leu Gly Pro 
675 680 685 

15 

Glu Tyr Asn Val Phe Ala Thr Gly Ser Gly Asp Cys Lys Ala Arg He 
690 695 700 

Trp Lys Tyr Lys Lys He Ala Pro Asn 
20 705 710 

(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 798 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



30 



(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



35 (vi) ORIGINAL SOURCE: 

(C) lOTIVIDUAL ISOLATE: TUPl HCMOLOG, Fig. 47 



40 

Met Ser Gin Lys Gin Ser Thr Asn Gin Asn Gin Asn Gly Thr His 
15 10 15 

Pro Gin Pro Val Lys Asn Gin Arg Thr Asn Asn Ala Ala Gly Ala Asn 
45 20 25 30 
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Ser Gly Gin Gin Pro Gin Gin Gin Ser Gin Gly Gin Ser Gin Gin Gin 
35 40 45 

Gly Arg Ser Asn Gly Pro Phe Ser Ala Ser Asp Leu Asn Arg He Val 
50 55 60 

Leu Glu Tyr Leu Asn Lys Lys Gly Tyr His Arg Thr Glu Ala Met Leu 
65 70 75 80 



10 



Arg Ala Glu Ser Gly Arg Thr Leu Thr Pro Gin Asn Lys Gin Ser Pro 
85 90 95 



15 



Ala Asn Thr Lys Thr Gly Lys Phe Pro Glu Gin Ser Ser He Pro Pro 
100 105 110 

Asn Pro Gly Lys Thr Ala Lys Pro He Ser Asn Pro Thr Asn Leu Ser 
115 120 125 



Ser Lys Arg Asp Ala Glu Gly Gly He Val Ser Ser Gly Arg Leu Glu 
20 130 135 140 

Gly Leu Asn Ala Pro Glu Asn Tyr He Arg Ala Tyr Ser Met Leu Lys 
150 155 160 



25 



Asn Trp Val Asp Ser Ser Leu Glu He Tyr Lys Pro Glu Leu Ser Tyr 
165 170 175 



30 



He Met Tyr Pro He Phe He Tyr Leu Phe Leu Asn Leu Val Ala Lys 
180 185 190 

Asn Pro Val Tyr Ala Arg Arg Phe Phe Asp Arg Phe Ser Pro Asp Phe 
195 200 205 



Lys Asp Phe His Gly Ser Glu He Asn Arg Leu Phe Ser Val Asn Ser 
35 210 215 220 

He Asp His He Lys Glu .^sn Glu Val Ala Ser Ala Plie Gin Ser His 
225 230 235 240 



40 



Lys Tyr Arg He Thr Met Ser Lys Thr Thr Leu Asn Leu Leu L-.- 
245 250 255 



45 



Phe Leu Asn Glu Asn Glu Ser He Gly Gly Ser Leu He He Ser Val 
260 265 270 

He Asn Gin His Leu Asp Pro Asn He Val Glu Ser Val Thr Ala Ara 
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275 280 235 

Glu Lys Leu Ala Asp Gly He Lys Val Leu Ser Asp Ser Glu Asn Gly 
290 295 300 

5 

Asn Gly Lys Gin Asn Leu Glu Met Asn Ser Val Pro Val Lys Leu Gly 
305 310 315 320^ 

Pro Phe Pro Lys Asp Glu Glu Phe Val Lys Glu He Glu Thr Glu Leu 
10 325 330 335 

Lys He Lys Asp Asp Gin Glu Lys Gin Leu Asn Gin Gin Thr Ala Gly 
340 345 350 

15 Asp Asn Tyr Ser Gly Ala Asn Asn Arg Thr Leu Leu Gin Glu Tyr Lys 

355 360 365 

Ala Met Asn Asn Glu Lys Phe Lys Asp Asn Thr Gly Asp Asp Asp Lys 
370 375 380 

20 

Asp Lys He Lys Asp Lys He Ala Lys Asp Glu Glu Lys Lys Glu Ser 
385 390 395 400 

Glu Leu Lys Val Asp Gly Glu Lys Lys Asp Ser Asn Leu Ser Ser Pro 
25 405 410 415 

Ala Arg Asp He Leu Pro Leu Pro Pro Lys Thr Ala Leu Asp Leu Lys 
420 425 430 

30 Leu Glu He Gin Lys Val Lys Glu Ser Arg Asp Ala He Lys Leu Asp 

435 440 445 

Asn Leu Gin Leu Ala Leu Pro Ser Val Cys Met Tyr Thr Phe Gin Asn 
450 455 460 

35 

Thr Asn Lys Asp Met Ser Cys Leu Asp Phe Ser Asp Asp Cys Arg He 
463 470 475 480 

^--i ^'-y ^ ■ - - '-"'P -5- T/r Z". \ ".y-j : ' . s«r Leu J.sp 
^0 485 490 495 

Gly Ser Ser Leu Asn Asn Pro Asn He Ala Leu Asn Asn Asn Asp Lys 
500 505 510 



45 



Asp Glu Asp Pro Thr Cys Lys Thr Leu Val Gly His Ser Gly Thr Val 
515 520 525 
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TiT Ser Thr Ser Phe Ser Pro Asp Asn Lys Tyr Leu Leu Ser Q^y S^r 
530 535 

Glu Asp Lys Thr Val Arg Leu Trp Ser Met Asp Thr His Thr Ala Leu 
' "0 

val Ser Tyr Lys 61y His Asn His Pro Val Txp Asp Val Ser Phe Ser 
565 570 

10 Pro Leu Gly Hia Tyr Phe Ala Thr Ala Ser His Asp Gin Thr Ala Arg 

580 585 590 

Leu Trp ser Cys Asp His He Tyr Pro Leu Arg He Phe Ala Gly His 
595 600 605 

Leu Asn Asp val Asp Cys Val Ser Phe His Pro Asn Gly Cys Tyr Val 
610 615 620 

Phe Thr Gly Ser Ser Asp Lys Thr Cys Arg Met Trp Asp Val Ser Thr 
^25 630 

635 640 



25 



30 



40 



45 



Gly Asp ser Val Arg Leu Phe Leu Gly His Thr Ala Pro Val He Ser 
"5 650 655 

lie Ala val Cys Pro Asp Gly Arg Trp Leu Ser Thr Gly Ser Glu Asp 
"° 665 670 

Gly He He Asn Val Tip Asp He Gly Thr Gly Lys Arg Leu Lys Gin 



675 680 



685 



Met Arg Gly His Gly Lys Asn Ala He Tyr Ser Leu Ser Tyr Ser Lys 
"° 695 700 

Glu Gly Asn val Leu He Ser Gly Gly Ala Asp His Thr Val Arg Val 

715 720 



Trp A=? Leu Lys Lys Ala Thr Thr Glu Pro Ser Ala Glu Pro Asp Glu 

730 735 

Pro Phe He Gly Tyr Leu Gly Asp Val Thr Ala Ser He Asn Gin A^p 
740 745 750 

He Lys Glu Tyr Gly Arg Arg Arg Thr Val He Pro Thr Ser Asp Leu 
755 760 765 

Val Ala ser Phe Tyr Thr Lys Lys Thr Pro Val Phe Lys Val Lys Phe 
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770 773 



780 



Ser Arg Ser Asn Leu Ala Leu Ala Gly Gly Ala Phe Arg Pro 
785 790 795 

5 

(2) INFORMATION FOR SEQ ID NOs65i 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 439 amino acids 
10 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

15 (iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOXJRCE: 

(C) INDIVIDUAL ISOLATE: YCU7, Fig. 48 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

Met Val Arg Arg Phe Arg Gly Lys Glu Leu Ala Ala Thr Thr Phe Asn 



25 



30 



40 



45 



10 15 
Gly His Arg Asp Tyr Val Met Gly Ala Phe Phe Ser His Asp Gin Glu 



20 25 



30 



Lys He Tyr Thr Val Ser Lys Asp Gly Ala Val Phe Val Trp Glu Phe 
35 40 45 

Thr Lys Arg Pro Ser Asp Asp Asp Asp Asn Glu Ser Glu Asp Asp Asp 
50 55 

Lys Gin Glu Glu Val Asp He Ser Lys T^n: Ser Trp Arg He Thr Lys 

55 7"> 

75 80 



Lys His Phe Phe Tyr Ala Asn Gin Ala Lys Val Lys Cys Val Th.- 
85 90 I's' 

His Pro Ala Thr Arg Leu Leu Ala Val Gly Phe Thr Ser Gly Glu Phe 

105 



Arg Leu Tyr Asp Leu Pro Asp Phe Thr Leu He Gin Gin Leu Ser 



Met 
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115 



Gly Gin Asn Pro Val Asn Thr Val Ser Val Asn Gin Thr Gly Glu Trp 
130 135 



Leu Ala Phe Gly Ser Ser Lys Leu Gly Gin Leu Leu Val Tyr Glu Trp 
145 150 155 



160 



Gin Ser Glu Ser Tyr lie Leu Lys Gin Gin Gly His Phe Asp Ser Thr 
10 165 170 175 

Asn Ser Leu Ala Tyr Ser Pro Asp Gly Ser Arg Val Val Thr Ala Ser 
180 185 190 



15 



Glu Asp Gly Lys He Lys Val Trp Asp lie Thr Ser Gly Phe Cys Leu 
195 200 205 



20 



Ala Thr Phe Glu Glu His Thr Ser Ser Val Thr Ala Val Gin Phe Ala 



210 



215 



220 



Lys Arg Gly Gin Val Met Phe Ser Ser Ser Leu Asp Gly Thr Val Arg 
225 230 235 



240 



25 



Ala Trp Asp Leu He Arg Tyr Arg Asn Phe Arg Thr Phe Thr Gly Thr 
245 250 



255 



Glu Arg He Gin Phe Asn Cys Leu Ala Val Asp Pro Ser Gly Glu Val 



260 



265 



270 



30 



Val Cys Ala Gly Ser Leu Asp Asn Phe Asp He His Val Trp Ser Val 
275 280 285 



35 



Gin Thr Gly Gin Leu Leu Asp Ala Leu Ser Gly His Glu Gly Pro Val 
290 295 300 

Ser Cys Leu Ser Phe Ser Gin Glu Asn Ser Val Leu Ala Ser Ala Ser 
310 315 320 



-rp Ajp Lys Thr 11;* ;.r7 '.^ ?ro :or V,e rhi2 'Jlv Ar-: '-r Gin Gin 
325 330 3I5 

Val Glu Pro He. Glu Val Tyr Ser Asp Val Leu Ala Leu Ser Met Arg 
340 345 350 



45 



Pro Asp Gly Lys Glu Val Ala Val Ser Thr Leu Lys Gly Gin He Ser 
355 360 365 
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. He Phe Asn He Glu Asp Ala Lyr. Gin Val Gly Asn lie Asp Cys Arg 
370 373 380 

Lys Asp He He Ser Gly Arg Phe Asn Gin Asp Arg Phe Thr Ala Lys 
5 385 390 395 4OO 

He Leu Asn Asp Pro Asn Phe Leu Leu Gin Tyr He Thr Val Leu Met 
405 410 415 



10 



15 



Val Trp Leu Leu Trp Leu Val Val He He Thr Pro Phe Val Tyr Met 
420 425 430 

Met Phe Gin Met Lys Ser Cys 
435 

(2) INFORMATION FOR SEQ ID NO: 66: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 514 amino acids 
20 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

25 (iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 
30 (C) INDIVIDUAL ISOLATE: YCW2 PROTEIN, Fig. 49 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

35 Met ser Thr Leu He Pro Pro Pro Ser Lys Lys Gin Lys Lys Glu Ala 

15 10 15 

Clr. I,-u ?io Arg Glu Val Ala He He Pro Lys Asp Leu Pro Asn Val 

40 

Ser He Lys Phe Gin Ala Leu Asp Thr Gly Asp Asn Val Gly Gly , 
35 40 45 

Leu Arg Val Pro Gly Ala He Ser Glu Lys Gin Leu Glu Glu Leu Leu 
^5 50 55 60 
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Asn Gin Leu Asn Gly Thr 3er Asp Asp Pro Val Pro Tyr Thr Phe Ser 
65 70 75 8G 

Cys Thr lie Gin Gly Lys Lys Ala Ser Asp Pro Val hys Thr lie Asp 
5 85 90 95 

lie Thr Asp Asn Leu Tyr Ser Ser Leu lie Lys Pro Gly Tyr Asn Ser 
100 105 110 

10 Thr Glu Asp Gin lie Thr Leu Leu Tyr Thr Pro Arg Ala Val Phe Lys 

115 120 125 



15 



Val Lys Pro Val Thr Arg Ser Ser Ser Ala lie Ala Gly His Gly Ser 
130 135 140 

Thr lie Leu Cys Ser Ala Phe Ala Pro His Thr Ser Ser Arg Met Val 
145 150 155 160 



20 



Thr Gly Ala Gly Asp Asn Thr Ala Arg lie Trp Asp Cys Asp Thr Gin 
165 170 175 



Thr Pro Met His Thr Leu Lys Gly His Tyr Asn Trp Val Leu Cys Val 
180 185 190 



25 



Ser Trp Ser Pro Asp Gly Glu Val lie Ala Thr Gly Ser Met hsp Asn 
195 200 205 



30 



Thr lie Arg Leu Trp Asp Pro Lys Ser Gly Gin Cys Leu Gly Asp Ala 
210 215 220 

Leu Arg Gly His Ser Lys Trp lie Thr Ser Leu Ser Trp Glu Pro lie 
225 230 235 240 



35 



His Leu Val Lys Pro Gly Ser Lys Pro Arg Leu Ala Ser Ser Ser Lys 
245 250 255 



Asp Gly Thr He Lys He Trp Asp Thr Val Ser Arg Val Cys Gin Tyr 
260 265 270 



40 



Thr Met Ser Gly His Thr Asn Ser Val Ser Cys Val Lys Trp Gly .-y 
275 280 285 



45 



Gin Gly Leu Leu Tyr Ser Gly Ser His Asp Arg Thr Val Arg Val Trp 
290 295 300 

Asp He Asn Ser Gin Gly Arg Cys He Asn He Leu Lys Ser His Ala 
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305 310 315 320 

His Trp Val Asn His Leu Ser Leu Ser Thr Asp Tyr Ala Leu Arg lie 
325 330 335 

5 

Gly Ala Phe Asp His Thr Gly Lys Lys Pro Ser Thr Pro Glu Glu Ala 
340 345 350 

Gin Lys Lys. Ala Leu Glu Asn Tyr Glu Lys He Cys Lys Lys Asn Gly 
10 355 360 365 

Asn Ser Glu Glu Met Met Val Thr Ala Ser Asp Asp Tyr Thr Met Phe 
370 375 380 

15 Leu Trp Asn Pro Leu Lys Ser Thr Lys Pro He Ala Arg Met Thr Gly 

385 390 395 400 

His Gin Lys Leu Val Asn His Val Ala Phe Ser Pro Asp Gly Arg Tyr 
405 410 415 

20 

He Val Ser Ala Ser Phe Asp Asn Ser He Lys Leu Trp Asp Gly Arg 
420 425 430 

Asp Gly Lys Phe He Ser Thr Phe Arg Gly His He Ala Ser Val Tyr 
25 435 440 445 

Gin Val Ala Trp Ser Ser Asp Cys Arg Leu Leu Val Ser Cys Ser Lys 
450 455 460 



30 



35 



40 



Asp Thr Thr Leu Lys Val Trp Asp Val Arg Thr Arg Lys Leu Ser Val 
465 470 475 480 

Asp Leu Pro Gly He Lys Thr Lys Leu Tyr Val Asp Trp Ser Val Asp 
485 490 495 

Gly Lys Arg Val Cys Ser Gly Gly Lys Asp Lys Met Val Arg Leu Trp 
SCO 505 510 



(2) INFORMATION FOR SEQ ID NO: 67: 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 852 amino acids 

(B) TYPE: amino acid 



PCrA,S95«,12,0 
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(D) TOPOLOGY: unknovm 

(ii) MOLECOLE TYPE: protein 

S (iii) HYP07BETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 
^° (C) INDIVIDUAL ISOLATE: YKL525, Fig. 50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:67: 

Met Phe Lys Ser Lys Thr Ser Thr Leu Ser Tyr Asp Glu Thr Pro Asn 



15 



20 



30 



35 



45 



10 15 



25 50 



ser Asn Glu Gly Asp Arg Asn Ala Thr Pro Val Asn Pro Lys Glu Lys 
2° 25 30 

ser Gin Thr Lys His Leu Asn He Pro Gly Asp Arg Ser Arg His Ser 
" " « 

Ser He Ala Asp Ser Lys Arg Ser Ser Ser Arg Tyr Asp Gly Gly Tyr 



55 



60 



ser Ala Asp lie He Pro Ala Gin Leu Arg Phe lie Asp Asn lie Asp 
" '° 75 80 



Tyr Gly Thr Arg Leu Arg Lys Thr Leu His Arg Asn Ser Val Val Ser 
85 90 35 

Asn Gly Tyr Asn Lys Leu Ser Glu Asn Asp Arg Trp Tyr Phe Asp Leu 
100 105 110 . 

Phe Asp Arg Lys Tyr Phe Glu Asn Tyr Leu Glu Glu Pro Thr Tyr 



ilS 120 



He 
125 



~'/3 -l<i r'.li Lys Lvz Iv; -1.. .— _ r~- 

'---n ?.-» Asn Arg Met PI- 

140 



40 130 135 



Leu Ala Gin Glu Leu Lys He Pro Asp Val Tyr Lys Ser Thr Thr Tyr 
"° 155 leo 

Gin Gly Glu Pro Ala Val Ala Asn Ser Glu Leu Phe Lys Asn Ser He 
1^5 170 175 



PCT/DS95«,1210 
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Cys Cys Cys Thr Ph3 o^ir His Asp Gly Lys Tyr Met Val He Gly Cys 
180 las „o 

Lys Asp Gly Ser Leu His Leu Trp Lys Val He Asn Ser Pro Val Lys 
5 195 200 205 

Arg Ser Glu Met Gly Arg Ser Glu Lys Ser Val Ser Ala Ser Arg Ala 
210 215 220 



10 



IS 



25 



30 



40 



45 



Asn Ser Leu Lys He Gin Arg His Leu Ala Ser He Ser Ser His 
225 230 235 



Asn 
240 



Gly Ser He Ser Ser Asn Asp Leu Lys Pro Ser Asp Gin Phe Glu Gly 
245 250 255 

Pro ser Lys Gin Leu His Leu Tyr Ala Pro Val Phe Tyr Ser Asp Val 
260 265 270 

Phe Arg Val Phe Met Glu His Ala Leu Asp He Leu Asp Ala Asn Trp 
2° 275 280 285 

Ser Lys Asn Gly Phe Leu He Thr Ala Ser Met Asp Lys Thr Ala Lys 
290 295 300 



Leu Trp His Pro Glu Arg Lys Tyr Ser Leu Lys Thr Phe Val His Pro 



305 310 33^5 



320 



Asp Phe Val Thr Ser Ala He Phe Phe Pro Asn Asp Asp Arg Phe He 
325 330 

He Thr Gly Cys Leu Asp His Arg Cys Arg Leu Trp Ser He Leu Asp 



340 



350 



Asn Glu val Ser Tyr Ala Phe Asp Cys Lys Asp Leu He Thr Ser Leu 
" 355 360 365 

Thr Leu Sor Pro Pro Gly Gly Glu Tyr Thr He He Gly Thr Phe Asn 
3'=' 375 380 



Gly Tyr He Tyr Val Leu Leu Thr His Gly Leu Lys Phe Val Se' - • 
385 390 395 ' 

Phe His Val ser Asp Lys Ser Thr Gin Gly Thr Thr Lys Asn Ser Phe 
*05 410 

His Pro Ser Ser Glu Tyr Gly Lys Val Gin His Gly Pro Arg He Thr 
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«0 425 

Gly Leu Gin Cys Phe Pha Ser Lys Val Asp Lys Asn Leu Arg Leu lie 
5 "5 440 

val Thr Thr Aan Asp Ser Lys He Gin lie Phe Asp Leu Asn 61u Lys 

«5 460 

Lys Pro Leu 61u Leu Phe Lys Gly Phe Gin Ser Gly Ser Ser Arg His 

475 480 

Arg Gly Gin Phe Leu Met Met Lys Asn Glu Pro Val Val Phe Thr Gly 



15 



20 



30 



35 



45 



485 490 



495 



Ser Asp Asp His Trp Phe Tyr Thr Trp Lys Met Gin Ser Phe 



500 505 



Asn Leu 
510 



ser Ala Glu Met Asn Cys Thr Ala Pro His Arg Lys Lys Arg Leu Ser 
515 520 

Gly Ser Met Ser Leu Lys Gly Leu Leu Arg He Val Ser Asn Lys Ser 
"° 535 540 



Thr Asn Asp Glu Cys Leu Thr Glu Thr Ser Asn Gin Ser Ser Ser His 

550 sec 

555 560 

Thr Phe Thr Asn Ser Ser Lys Asn Val Leu Gin Thr Gin Thr Val Gly 

570 

ser Gin Ala He Lys Asn Asn His Tyr lie Ser Phe His Ala His Asn 
"° 585 590 

ser Pro Val Thr Cys Ala Ser He Ala Pro Asp Val Ala He Lys Asn 

600 605 

Leu Scr Leu Ser Asn Asp Leu He Phe Glu Leu Thr Ser Gin Tyr Phe 



€10 

*-/3 Clu Mac 01 V T'*-- 
40 625 



615 



620 



■^r:- -:v Ciu S- J.yn - I 7-r Cv= Aso A^p 

"0 



Lys Pro Asn His Pro Val Thr Glu Thr Gly Gly Phe Ser 



650 



Ser Asn Leu 
655 



ser Asn Val Val Asn Asn Val Gly Thr He Leu He Thr Thr Asp Ser 

670 



665 



i 
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Gin Gly Leu lie Arg Val rhe Pjrj Thr Asp lie Leu Pro Glu lis Arg 
fi'S .J30 683 

Lys Lys He He Glu Lys Phe His Glu Tyr Asn Leu Phe His Leu Glu 
5 690 695 

Ala Ala Gly Lys He Asn Asn His Asn Asn Asp Ser He Leu Glu Asn 
710 715 720 

10 Arg Met Asp Glu Arg Ser Ser Thr Glu Asp Asn Glu Phe Ser Thr Thr 

725 730 735 

Pro Pro Ser Asn Thr His Asn Ser Arg Pro Ser His Asp Phe Cys Glu 
'40 745 750 

15 

Leu His Pro Asn Asn Ser Pro Val He Ser Gly Met Pro Ser Arg Ala 
755 760 765 

Ser Ala He Phe Lys Asn Ser He Phe Asn Lys Ser Asn Gly Ser Phe 
20 770 775 780 

He Ser Leu Lys Ser Arg Ser Glu Ser Thr Ser Ser Thr Val Phe Gly 
790 795 800 



25 



30 



Pro His Asp He Pro Arg Val Ser Thr Thr Tyr Pro Lys Leu Lys Cys 
805 810 815 

Asp Val Cys Asn Gly Ser Asn Phe Glu Cys Ala Ser Lys Asn Pro He 
820 825 830 

Ala Gly Gly Asp Ser Gly Phe Thr Cys Ala Asp Cys Gly Thr He Leu 
835 840 845 

Asn Asn Phe Arg 
350 

(:) :::?cr:-:aticm fcr seq id iro:S8: 



(A) LENGTH: 798 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unJaiown 



45 



(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 



10 



15 



WO 95/21252 
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(iv) ANTI-SSW3E: IJO 

(vi) ORIGINAL SOURCE: 
^ (C) INDIVIDUAL ISOLATE: yrb 1410 yeast. Pig. 51' 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

Met Ser Gin Lys Gin Ser Thr Aan Gin Aan Gin Asn Gly Thr His Gla 
5 10 15 

Pro Gin Pro val Lys Asn Gin Arg Thr Asn Asn Ala Ala Gly Ala Asn 

25 30 

ser Gly Gin Gin Pro Gin Gin Gin Ser Gin Gly Gin Ser Gin Gin Gin 
35 40 

Gly Arg Ser Asn Gly Pro Phe Ser Ala Ser Asp Leu Asn Arg He Val 

60 

^^u Glu Tyr Leu Asn Lys Lys Gly Xyr His Arg Thr Glu Ala Met Leu 

70 

80 

Arg Ala Glu Ser Gly Arg Thr Leu Thr Pro Gin Asn Lys Gin Ser Pro 

Ala Asn Thr Lys Thr Gly Lys Phe Pro Glu Gin Ser Ser He Pro Pro 
"° "5 

Asn Pro Gly Lys Thr Ala Lys Pro He Ser Asn Pro Thr Asn Leu Ser 

120 

ser Lys Arg Asp Ala Glu Gly Gly He Val Ser Ser Gly Arg Leu Glu 

■^^^ 140 

Gly Leu Asn Ala Pro Glu Asn Tyr He Arg Ala Tyr Ser Met Leu Lys 

Asn rrp Va: Xsp 3 -r 



20 



25 



30 



35 



40 



45 



150 

160 



.- i —a Tyr :./3 Pro Glj Leu £^ ^ 

"5 

He Met Tyr Pro He Phe He Tyr Leu Phe Leu Asn Leu Val Ala Lys 

"5 ISO 

Asn Pro Val Tyr Ala Arg Arg Phe Phe Asp Arg Phe Ser Pro Asp Phe 



195 



200 



205 
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L/3 Asp Phe His Gly Ser Glu lie Asn Arg Leu Phe Ser Val Asn Ser 
210 215 220 

lie Asp His lie Lys Glu Asn Glu Val Ala Ser Ala Phe Gin Ser His 
5 225 230 235 240 

Lys Tyr Arg lie Thr Met Ser Lys Thr Thr Leu Asn Leu Leu Leu Tyr 
245 250 255 

10 Phe Leu Asn Glu Asn Glu Ser He Gly Gly Ser Leu He He Ser Val 

260 265 270 

He Asn Gin His Leu Asp Pro Asn He Val Glu Ser Val Thr Ala Arg 
275 280 2B5 

15 

Glu Lys Leu Ala Asp Gly He Lys Val Leu Ser Asp Ser Glu Asn Gly 
290 295 300 

Asn Gly Lys Gin Asn Leu Glu Met Asn Ser Val Pro Val Lys Leu Gly 
20 305 310 315 320 

Pro Phe Pro Lys Asp Glu Glu Phe Val Lys Glu He Glu Thr Glu Leu 
325 330 335 

25 Lys He Lys Asp Asp Gin Glu Lys Gin Leu Asn Gin Gin Thr Ala Gly 

340 345 350 

Asp Asn Tyr Ser Gly Ala Asn Asn Arg Thr Leu Leu Gin Glu Tyr Lys 
355 360 365 

30 

Ala Met Asn Asn Glu Lys Phe Lys Asp Asn Thr Gly Asp Asp Asp Lys 
370 375 380 

Asp Lys He Lys Asp Lys He Ala Lys Asp Glu Glu Lys Lys Glu Ser 
35 385 390 395 400 

Glu Leu Lys Val A3p Gly Glu Lyn Lys Asp Ser Asn Leu Ser Ser Pro 
403 410 ' 415 

40 Ala Arg Asp He LeU Pro Leu Pro Pro Lys Thr Ala Leu Asp Leu , ^ 

420 425 430 

Leu Glu He Gin Lys Val Lys Glu Ser Arg Asp Ala He Lys Leu Asp 
435 440 445 



45 



Asn Leu Gin Leu Ala Leu Pro Ser Val Cys Met Tyr Thr Phe Gin Asn 
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10 



15 



20 



25 



30 



35 



40 



45 



450 



460 



Thr Asn Lys Asp Met Ser Cys Leu Asp Phe Ser Asp Asp Cys Arg lie 

475 

Ala Ala Ala Gly Phe Gin Asp Ser Tyr He Lys He Trp Ser Leu Asp 
485 490 

Gly ser Ser Leu Asn Asn Pro Asn He Ala Leu Asn Asn Asn Asp Lys 
500 505 

Asp Glu Asp Pro Thr Cys Lys Thr Leu Val Gly His Ser Gly Thr Val 

520 

Tyr ser Thr Ser Phe Ser Pro Asp Asn Lys Tyr Leu Leu Ser Gly Ser 
"° 535 540 

Glu Asp Lys Thr Val Arg Leu Trp Ser Met Asp Thr His Thr Ala Leu 



545 



550 



555 



560 



val ser Tyr Lys Gly His Asn His Pro Val Trp Asp Val Ser Phe Ser 
5^5 570 



575 



Pro Leu Gly His Tyr Phe Ala Thr Ala Ser His Asp Gin Thr Ala 



580 



565 



Arg 



590 



Leu Trp Ser Cys Asp His He Tyr Pro Leu Arg He Phe Ala Gly His 

600 505 

Leu Asn Asp Val Asp Cys Val Ser Phe His Pro Asn Gly Cys Tyr Val 



610 



615 



620 



Phe Thr Gly Ser Ser Asp Lys Thr Cys Arg Met Trp Asp Val Ser Thr 

"5 



Val lie Ser 
6S5 



Gly Asp Ser Val Arg Leu Phe Leu Gly Kis Thr Ala Pro 

no .la VM cy. Pro Arp Cly ^r ; r.-.s r.y Ser Glu Asz: 

"° 665 670 

Gly lie He Asn Val Trp Asp lie Gly Thr Gly Lys Arg Leu Lys Gin 



675 



680 



685 



Met Arg Gly His Gly Lys Asn Ala lie Tyr Ser Leu Ser Tyr Ser Lys 
"° «5 700 
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Glu Gly Asn val Leu He Ser Cly oly Aia Asp His Thr Val Arg Val 



10 



15 



Trp ASP Leu Lys Lys Ala Thr Thr Glu Pro Ser Ala Glu Pro Asp Glu 

Pro Phe lie Gly Tyr Leu Gly Asp Val Thr Ala Ser He Asn Gin Asp 

745 

He Lys Glu Tyr Gly Arg Arg Arg Thr Val lie Pro Thr Ser Asp Leu 

760 7„ 

val Ala ser Phe Tyr Thr Lys Lys Thr Pro Val Phe Lys Val Lys Phe 

775 

ser Arg Ser Asn Leu Ala Leu Ala Gly Gly Ala Phe Arg Pro 
785 790 



20 (2) INFORMATION FOR SEQ ID NO:69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: \inknown 



(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



30 



35 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: RACKl protein rl. Pig. ic 

Ui) SZQV^llCB rSSCRIPTIO::: SEQ ID >:0:59: 

T:ly :-:i3 Asn 'Jlv Trp Val T^r .-rn - ^ o ^ * , -u ^■ 
40 1 ^^'^ Phe Fro 

^ 10 , 



25 



30 



45 



ASP Met lie Leu Ser Ala Ser Arg Asp Lys Thr lie He Met Trp Ly. 

20 ^ 

(2) INFORMATION FOR SEQ ID NO: 70: 



15 



- 179 - 

(i) SEQUEircE CHA?j:iCTZ;;r:i7~CS : 

(A) LENGTH: 31 aiiiino acids 
(3) TYPE: amino acid 
(D) TOPOLOGY: unJoiown 

5 

(ii) MOLECOLE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

10 (iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: RACKl protein rll. Fig. IC 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

Gly His Ser His Phe Val Ser Asp Val Val He Ser Ser Asp Gly Gin 

Phe Ala Leu Ser Gly Ser Trp Asp Gly Thr Leu Arg Leu Trp Asp 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unJcnovm 

30 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
35 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOLT.CE: 

(C) i:.-DTviDU7a, ISOLATE: RACKl protein rlH, Fig. ic 



20 



25 



40 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

Gly His Thr Lys Asp Val Leu Ser Val Ala Phe Ser Ser Asp Asn Arg 
'5 10 15 

Gin He Val Ser Gly Ser Arg Asp Lys Thr He Lys Leu Trp Asn 
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30 



(2) IITFORI^ATION FCR SEQ ID NO: 72: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: iinknown 

10 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

15 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: RACKl protein rIV, Fig. IC 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

Ser His Ser Glu Trp Val Ser Cys Val Arg Phe Ser Pro Asn Ser Ser 



25 



30 



45 



10 



15 



Asn Pro He He Val Ser Cys Gly Trp Asp Lys Leu Val Lys Val Trp 
20 25 30 

Asn 

(2) INFORMATION FOR SEQ ID NO: 73: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 
35 (B) r/PE: amino acid 

(D) TOPOLOGY: unknown 

(ii) XOLZCT^^ TYrZ: peptica 

40 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: RACKl protein rV, Fig. 10 
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(Xi) SEQUENCE Ci$C?.I?TIC;.-: S2Q ID NO: 73: 

Gly His Thr Gly 7yr Leu Asn Thr Val Thr Val Ser Pro Asp Gly Ser 

Leu Cys Ala Ser Gly Gly Lys Asp Gly Gin Ala Met Leu Trp Asp 
" 25 30 

(2) INFORMATION FOR SEQ ID NO: 74: 



10 



15 



25 



30 



35 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

20 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: RACKl protein rVl, Fig. 10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

Leu Asp Gly Gly Asp He lie Asn Ala Leu Cys Phe Ser Pro Asn Arg 



10 



15 



Tyr Trp Leu Cys Ala Ala Thr Gly Pro Ser He Lys lie Trp Asp 

25 30 

(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CKAFACTERISTICS : 

(A) LENGTIi: 3 3 anino acids 
(3) TYPE: amino acid 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



45 



(iv) ANTI- SENSE: NO 
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(vi) ORIGINAL 3CURCE: 

(C) INDIVIDUAL IS0L.!;7E: RACKl protein rVIl, Fig. ic 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

Ser Lys Ala Glu Pro Pro Gin Cys Thr Ser Leu Ala Trp Ser Ala Asp 
15 10 15 

Gly Gin Thr Leu Phe Ala Gly Tyr Thr Asp Asn Leu Val Arg Val Trp^ 
20 25 30 

Gin 



10 



15 



30 



35 



40 



(2) INFORMATION FOR SEQ ID NO: 76: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 
20 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

25 (iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Human 55 kDa protein rl. Fig. ll 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:76: 

Gly His Thr Asp Ala Val Leu Asp Leu Ser Trp Asn Lys Leu He Arg 
1 5 10 15 

i^sn VcLl Leu Ala Ser Ala Ser Ala Asp Asn Thr Val He Leu Trp Asp 

30 

(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 
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( D ) TO? OLOG V : unkriC .n 

(ii) MOLECULE TYPE: peptide 
5 (iii) HYPOTHETICAL; NO 

(iv) ANTI-SENSE: NO 



10 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Human 55 kDa protein rll. Fig, 11 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

15 Ala His Asn Asp Glu He Ser Gly Leu Asp Leu Ser Ser Gin He Lys 

15 10 15 

Gly Cys Leu Val Thr Ala Ser Ala Asp Lys Tyr Val Lys He Trp Asp 
20 25 30 

20 

(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



30 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

35 (vi) ORIGINAL SOURCE: 

(C) lOTIVIDUAL ISOLATE: Human 55 kDa protein rlll, Fig. 11 



40 

Val His Ser Arg Asp Met Lys Met Gly Val Leu Phe Cys Ser Ser . . 
1 5 10 13 ' 

Cys Pro Asp Leu Pro Phe He Tyr Ala Phe Gly Gly Gin Lys Glu Gly 
45 20 25 30 
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Leu Arg Val Trp Asp 
35 

(2) INFORMATION FOR SEQ ID NO: 79: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknom 

10 

(ii) MOLECtTLE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
15 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: AAC-RICH protein rl. Fig. 12 



20 



25 



30 



35 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

Gly Asn Lys Lys Lys Ser Thr Ser Val Ala Trp Asn Ala Asn Gly Thr 
15 10 15 

Lys lie Ala Ser Ser Gly Ser Asp Gly lie Val Arg Val Trp Asn 
20 25 30 

(2) INFORMATION FOR SEQ ID N0:80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 

(iii) HY?0TH2TICAL: 110 

40 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOXJRCE: 

(C) INDIVIDUAL ISOLATE: AAC-RICH protein rll. Fig. 12 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 
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Gly His Asp Gly Ser Ila Glu Lys II2 Ser Trp Ser Pro Lys Asn Asn 
1 5 10 15 

Asp Leu Leu Ala Ser Ala Gly Thr Asp Lys Val lie Lys He Trp Asp 
5 20 25 . 30 

(2) INFORMATION FOR SEQ ID NO: 81: 

10 (1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: ujikr.ovm 

15 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

20 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: AAC-RICH protein rlll. Fig. 12 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:81: 

Asp His Leu Ala Leu He Asp Leu Pro Thr He Lys Thr Leu Lys He 
15 10 15 

30 Tyr Lys Phe Asn Gly Glu Glu Leu Asn Gin Val Gly Trp Asp 

20 25 30 

(2) INFORMATION FOR SEQ ID NO: 82: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENG17I: 31 amino acids 
(3) TYPE: amino acid 

(D) TOPOIOGY: unkncv.-n 

40 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 



45 



(iv) ANTI -SENSE: NO 
(vi) ORIGINAL SOURCE: 
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(C) INDIVIDUAL ISCLAT2: AAC-RIC3 ?ro!:ein rIV, Fig. 12 



5 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

Gly His Thr Ala Ser lie Tyr Cys Met Glu Phe Asp Pro Thr Gly Lys 
15 10 15 

Tyr Leu Ala Ala Gly Ser Ala Asp Ser lie Val Ser Leu Trp Asp 
10 20 25 30 

(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



20 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

25 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: BETA TRCP rl. Fig. 13 



(xi) SEQUENCE DESCRIPTION: . SEQ ID NO: 83: 

30 

He His Cys Arg Ser Glu Thr Ser Lys Gly Val Tyr Cys Leu Gin Tyr 
15 10 15 

Asp Asp Gin Lys He Val Ser Gly Leu Arg Asp Asn Thr He Lys He 
35 20 25 30 

Trp Asp 



40 (2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
45 (D) TOPOLOGY: unknown 
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(ii) M0LECUL2 TYPE: peptide 

(iii) KYPOTESTICALi NO 

5 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: BETA TRCP rll, Pig. 13 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

Gly His Thr Gly Ser Val Leu Cys Leu Gin Tyr Asp Glu Arg Val He 



10 



15 



20 



25 



35 



40 



15 10 

He Thr Gly Ser Asp Ser Thr Val Arg Val Trp Asp 
20 25 

(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



15 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

30 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: BETA TRCP rlll, Fig. 13 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

He His Kis Cys Glu Ala Val Leu His Leu Arg Phe Asn Asn Gly Met 

Met Val Thr Cys Ser Lys Asp Arg Ser lie Ala Val Trp Asp 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 86: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LE:T'3TK: 29 amino acids 

(B) TYPE: amino acid 
(D) TOPOUOGY: uiOcnown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: BETA TRCP rIV, Fig. 13 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

Gly His Arg Ala Ala Val Asn Val Val Asp Phe Asp Asp Lys Tyr lie 
15 10 15 

20 Val Ser Ala Ser Gly Asp Arg Thr lie Lys Val Trp Asn 

20 25 

(2) INFORMATION FOR SEQ ID NO: 87: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknovm 

30 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 



35 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ZSOLATZ: BETA TRCP rV, Fig. 13 



40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

Gly His Lys Arg Gly He Ala Cys Leu Gin Tyr Arg Asp Arg Leu Val 
15 10 15 

45 Val Ser Gly Ser Ser Asp Asn Thr He Arg Leu Trp Asp 

20 25 
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(2) ii;?o?j*:atign for seq id I;0;5^: 

(i) SEQUENCE CHARACTERIGTICS : 

(A) LENGTH: 29 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: unJcnown 

(ii) MOLECULE TYPE: peptide 

10 (iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 
15 (C) INDIVIDUAL ISOLATE: BETA TRCP rVI, Pig. 13 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:88: 



20 



25 



Gly His Glu Glu Leu Val Arg Cys He Arg Phe Asp Asn Lys Arg He 
15 10 15 

Val Ser Gly Ala Tyr Asp Gly Lys He Lys Val Trp Asp 
20 25 

(2) INFORMATION FOR SEQ ID NO:89: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 
30 (B) TYPE: amino acid 

(D) TOPOLOGY: unknovm 

(ii) MOLECULE TYPE: peptide 

35 (iii) HYPOTHZTICAL : NO 

(iv) Aim -SENSE: ::o 

(vi) C*^IGi::Aii CC'SXZZ: 
40 (C) INDIVIDUAL ISOLATE: BETA TRCP rVII, Fig. 13 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

45 Glu His Ser Gly Arg Val Phe Arg Leu Gin Phe Asp Glu Phe Gin He 

15 10 15 
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Val Ser Ser S-r His Asp Asp Thr lie Leu lie Trp Asp 
20 23 



5 



10 



20 



25 



30 



35 



45 



(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

15 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: beta -prime -cop rl. Pig. 14 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

Ala His Ser Asp Tyr He Arg Cys He Ala Val His Pro Thr Gin Pro 

Phe He Leu Thr Ser Ser Asp Asp Met Leu He Lys Leu Trp Asp 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



(ii) KOLECUIE TY?!3: peptide 

(iii) KTPCTHETICAL: NO 

40 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: * 

(C) INDIVIDUAL ISOLATE: beta -prime -cop rll. Fig. 14 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 



30 



35 



30 
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Gly Hia Thr His Tyr Val Met Gin He Val He Asn Pro Lys Asd Asn 

Asn Gin Phe Ala Ser Ala Ser Leu Asp Arg Thr He Lys Val Trp Gin 
5 20 25 

(2) INFORMATION FOR SEQ ID NO: 92: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unkncvm 

15 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

20 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: beta-prime- cop rlll, Fig. 14 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 

Gly His Glu Lys Gly Val Asn Cys He Asp Tyr Tyr Ser Gly Gly Asp 



10 



15 



Lys Pro Tyr Leu He Ser Gly Ala Asp Asp Arg Leu Val Lys He Trp 
20 25 30 

Asp 

(2) INF0RIC»TION FOR S2Q ID :fO:93: 



(i) £EQUEi;C2 CrL' P.ACTZ^IsviCS : 
(A) LZIvj'.M: 3i 
*° (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

45 (iii) HYPOTHETICAL: NO 



BEST AVAILABLE COPY 
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(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSS: NO 

5 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: CDC4 / CDC20 protein rll, Fig. 15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 

10 

Gly His Asp Gly Gly Val Trp Ala Leu Lys Tyr Ala His Gly Gly He 
15 10 15 

Leu Val Ser Gly Ser Thr Asp Arg Thr Val Arg Val Trp Asp 
^5 20 25 30 

(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



25 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

30 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: CDC4 / CDC20 protein rlll. Pig. 15 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 

Gly His Asn Ser Thr Val Arg Cys Leu Asp He Val Glu Tyr Lys Asn 
15 10 15 



11= Lys 7yr ZI2 Val ^ly Arg Aso .\y.i r..r Lci Vc 

" 20 25 30 

Lys 



1 Trr 



45 (2) INFORMATION FOR SEQ ID NO: 97: 
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(i) SEQUSNCS CHAR.\CT:ir.:3TIC3: 

(A) LENGTH: 29 i.z:.:io aci.is 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

5 

(ii) MOLECOLE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
10 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: CDC4 / CDC20 protein rlV, Fig. 15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

Gly His Met Ala Ser Val Arg Thr Val Ser Gly His Gly Asn He Val 
^5 10 15 

Val Ser Gly Ser Tyr Asp Asn Thr Leu He Val Trp Asp 
20 25 

(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



15 



20 



25 



30 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

35 (iv) ANTI-SENSE: NO 

(vi) CRIGI::AL £0U?.C2: 

(C) imviDU;^ ISOUirE: CZ:C4 / CDC20 protein rV, Fig. 15 



40 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 



Gly His Thr Asp Arg He Tyr Ser Thr He Tyr Asp His Glu Arg Lys 
^5 10 15 

Arg Cys He Ser Ala Ser Met Asp Thr Thr He Arg He Trp Asp 
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20 25 30 

(2) INFORMATION FOR SEQ ID KG: 99: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknom 

10 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

15 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: CDC4 / CDC20 protein rVI, Fig. 15 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 

Gly His Thr Ala Leu Val Gly Leu Leu Arg Leu Ser Asp Lys Phe Leu 
1 5 10 15 

25 Val Ser Ala Ala Ala Asp Gly Ser lie Arg Gly Trp Asp 

20 25 

(2) INFORMATION FOR SEQ ID NO: 100: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: unlmown 

35 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(IV) A.r::-:i::.::: :;o 

40 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: GBLP-CHLAMIDOMONAS HOMOLOG rl. Fig. 16 

45 (xi) SEQX7ENCE DESCRIPTION: SEQ ID NO: 100: 
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Gly His Thr Asn Trp Val Thr Ala lis Ala Thr Fro Leu Psp Pro Ser 
1 5 10 15 

Ser Asn Thr Leu Leu Ser Ala Ser Arg Asp Lys Ser Val Leu Val Trp 
5 20 25 30 

Glu 



10 (2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



20 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: GBLP - CHLAMIDOMONAS HOMOLOG rll. Fig. 



25 16 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: • 

Gly His Ser His Phe Val Gin Asp Val Val He Ser Ser Asp Gly Gin 
15 10 15 

Phe Cys Leu Thr Gly Ser Trp Asp Gly Thr Leu Arg Leu Trp Asp 
20 25 30 

INFORT-'ATICN FO.^ SEQ ZD NO: 102: 

(i) SEQCE>:CE CK;JIACT2RI3T:CS: 

;a) Li:iGr:i: 21 anino a-iis 
40 (B) TYPE: amino acid 

(D) TOPOLOGY: untaown 

(ii) MOLECULE TYPE: peptide 



35 

(2) 



45 



(iii) HYPOTHETiaU.: NO 
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(iv) AMTr-SST3E: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: GBLP - CHLAMIDOMONAS HOMOLOG rlll, Fig. 

5 16 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 

Gly His Thr Lys Asp Val Leu Ser Val Ala Phe Ser Val Asp Asn Arg 
10 1 5 10 15 

Gin lie Val Ser Gly Ser Arg Asp Lys Thr lie Lys Leu Trp Asn 
20 25 30 

15 (2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 
20 (D) TOPOLOGY: unknown 

(ii) NOZ^CULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



25 



30 16 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: GBLP -CHLAMIDOMONAS HOMOLOG rIV, Fig. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 

Gly His Thr Glu Trp Val Ser Cys Val Arg Phe Ser Pro Met Thr Thr 
35 1 5 10 15 

Txsn Pro He lis Val Ser Giy Gly Trp Asp Lys Met Val Lys Val Trp 
20 25 30 

40 Asn 



(2) INFORMATION FOR SEQ ID NO: 104: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 
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(B) TYPE: e-rrino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

5 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

10 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: GBLP - CHLAMIDOMONAS HOMOLOG rV, Pig. 

16 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 

Gly His His Gly Tyr Val Asn Thr Val Thr Val Ser Pro Asp Gly Ser 
1 5 . 10 15 

20 Leu Cys Ala Ser Gly Gly Lys Asp Gly He Ala Met Leu Trp Asp 

20 25 30 

(2) INFORMATION FOR SEQ ID N0:105: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

30 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



35 



40 



45 



(vi) CKIOINAL SOURCE: 

IC) INDIVIDUAL ISOLATE: GHIi? - CHLA:-1ICCM0NAS ECXOLCG rVI, Fig. 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 

He His Cys Leu Cys Phe Ser Pro Asn Arg Tyr Trp Leu Cys Ala Ala 
15 10 15 

Thr Gin Ser Ser He Lys He Trp Asp I-eu Glu Ser Lys Ser He Val 
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20 



25 



30 



(2) INFORMATION FOR SEQ ID NO: 106: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino adids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

10 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: HO 

15 (iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

CC) INDIVIDUAL ISOLATE: GBLP -CHLAMIDOMONAS HOMOLOG rVII, Fig. 



20 



16 



(xi) SEQXmNCE DESCRIPTION: SEQ ID N0:106: 

Lys Lys Ala Gin Val Pro Tyr Cys Val Ser Leu Ala Trp Ser Ala Asp 
25 1 5 10 15 

Gly Ser Thr Leu Tyr Ser Gly Tyr Thr Asp Gly Gin lie Arg Val Trp 
20 25 30 

30 Ala 



(2) INFORMATION FOR SEQ ID NO: 107: 

35 (i) SEQL'ENCE CHARACTERISTICS: 

(P.) LENGTH: 33 amino acids 
(£) TYPE: anino acid 
(D) TCrOLCGY; UTiJ-Jiown 

40 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

45 

(vi) ORIGINAL SOURCE: 
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(C) IKDIVIDUAL ISCIATE: cop-1 protein rl, Fig. 17 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 

5 

Met Ser Thr Arg Ser Lys Leu Ser Cye Leu Ser Trp Asn Lys His Glu 
15 10 15 

Lys Asn His lie Ala Ser Ser Asp Tyr Glu Gly lie Val Thr Val Trp 
10 20 25 30 

Asp 

15 (2) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 
20 (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



25 



30 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: cop-1 protein rll. Fig. 17 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 

Glu Lys Arg Ala Trp Ser Val Asp Phe Ser Arg Thr Glu Pro Ser Met 
35 1 5 10 15 

Lau V2l Sar Gly Ssr As? Asp Cys Lys Val Lys Val Trp Cys 
20 23 30 

40 (2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 
45 (D) TOPOLOGY: unJoiown 



10 



15 
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(ii) MOLECUI.:: TYPE: pepcii^ 

(iii) HYPOTHETICAL: NO 

5 (iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: cop-1 protein fill. Fig. 17 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 

Gly His Lys Lys Ala Val Ser Tyr Met Lys Phe Leu Ser Asn Asn Glu 
15 10 15 

Leu Ala Ser Ala Ser Thr Asp Ser Thr Leu Arg Leu Trp Asp 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

25 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
30 (iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Coronin (p55) rl. Fig. 19 



20 



35 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 

Gly His Lys Ser Ala V=xl Leu As? lie Ala Phe Eis Pro Ph- Asn Glu 
• = 13 

Asn Leu Val Gly Ser Val Ser Glu Asp Cys Asn He Cys He Trp Gi / 
20 25 30 



45 (2) INFORMATION FOR SEQ ID NO: 111: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amiriO acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: un3aiown 

5 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
10 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Coronin (p55) rll. Fig. 19 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOilll: 

Gly His Lys Arg Lys Val Gly Thr He Ser Phe Gly Pro Val Ala Asp 
1 5 10 * 15 

20 

Asn Val Ala Val Thr Ser Ser Gly Asp Phe Leu Val Lys Thr Trp Asp 
20 25 30 



25 (2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
30 (D) TOPOLOGY: unJcnovm 



(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 

(iv) Aim -SENSE: NO 



35 



(vi) CRIGIIT7-L SCUP.CE: 

iC) -.T-IVirUA-j ISOL-Mii: CorOTiia (p55) rlll, ^ig. 13 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 

Gly His Ser Asp Met He Thr Ser Cys Glu Trp Asn His Asn Gly Ser 
45 1 5 10 15 



^0'S«1252 PC7/US95A)1210 

Gin He Val 1'hr i:ir Cys Lys Asp Lys Lys Ala Arg Vai ^•"^ is? 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 113: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

10 

(ii) MOLECULE TYPE: peptide 
(iii) HYP0TH3TICAL : NO 
15 (iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: CORO PROTEIN rl, Fig. 18 



20 



25 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 

Arg His Val Phe Ala Ala Gin Pro Lys Lys Glu Glu Cys Tyr Gin Asn 
15 10 15 

Leu Lys Thr Lys Ser Ala Val Trp Asp Ser Asn Tyr Val Ala Ala Asn 



20 25 

Thr Arg Tyr He Trp Asp 
30 35 

(2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 32 amino acids 

(D) TYPE: amino acid 
(D) TOPOLOGY: urJcnovm 



30 



(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



45 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: CORO PROTEIN rll. Fig. 18 
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(xi) SEQUENCE L^2GCxiIPTICri: ZLj ZD llOiXlu 

Gly His Lys Ser Ala Val Leu Asp He Ala Phe His Pro Phe Asn Glu 

10 ,5 . 

Asn Leu Val Gly Ser Val Ser Glu Asp Cys Asn He Cys He Trp Gly 



10 



25 



30 



35 



20 25 



(2) INFORMATION FOR SEQ ID NO: 115: 



30 



(i) CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

20 (iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE; NO 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: CORO PROTEIN rlll. Fig. 18 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 

Gly His Lys Arg Lys Val Gly Thr He Ser Phe Gly Pro Val Ala Asp 
5 

Asn val Ala Val Thr Ser Ser Gly Asp Phe Leu Val Lys Thr Txp Asp 



20 25 



30 



(2) r;i7CR:C..?TC:T for SEQ id 1J0:116: 

[i, ScVCi::lCZ CH:-J>ACTH:McriC3: 

<A) LENGTH: 29 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



45 



(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



PCT/OS95«1210 

- 205 - 

(iv) ANTI- SENSE: UO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: CORO PROTEIN rIV, Pig. 18 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 

Gly His Ser Asp Met lie Thr Ser Cya Glu His Asa Gly Ser Gin lie 



25 



30 



35 



15 



"1 5 10 

Val Thr Thr Cys Lys Asp Lys Lys Ala Arg Val Phe Asp 

20 25 

15 (2) INFORMATION FOR SEQ ID N0:117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
2^ (D) TOPOLOGY: untoown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: CSTF 50JcDa rl, Pig. 20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 

Asp His Val Asp Glu Val Thr Cys Leu Ala Phe His Pro Thr Glu Gin 
1 «; 



10 15 



He Leu Ala G-r Gly Ser Arg Asp Tyr Thr Leu Lys Leu P^^- Asp 
=° ^5 30 

40 (2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPr: peptic3 

(iii) HYPOTHETICAL: NO 

5 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: CSTF 50kDa rll. Fig. 20 



10 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 

Asp His Val Asp Glu Val Thr Cys Leu Ala Phe His Pro Thr Glu Gin 
15 10 15 

lie Leu Ala Ser Gly Ser Arg Asp Tyr Thr Leu Lys Leu Phe Asp 
20 25 30 



20 (2) INFORMATION FOR SEQ ID NO: 119: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 
25 (D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



30 



35 



45 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: CSTF 50kDa rlll. Fig. 20 

(xi) SEQUE^ICS DESCRIPTION: S2Q ID N0:119: 

.Mi His Asp Giy Ala *'?*. s^-^-r M\ Jls r^ia ?.* r -.--t A-jr. Ser 



40 1 5 



10 3 5 



Lys Tyr lie Leu Ser Ser Gly Lys Asp Ser Val Ala Lys Leu Trp Glu 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 120: 
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15 



(i) SECU2NCE CHARACTERISTICS J 

(A) LENGTH: 29 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

5 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
10 (iv) ANTI-SEMSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: CSTF SOkDa rIV, Fig. 20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 

Val His Arg Thr Gin Ala Val Phe Asn His Thr Glu Asp Tyr Val Leu 
15 10 15 

Leu Pro Asp Glu Arg Thr lie Ser Leu Cys Cys Trp Asp 
20 25 

(2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: untaown 

30 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
35 (iv) ANTI-SENSS: NO 

(vi) CP.IGrilAL SOURCE: 

(C) IITOIVIDUAL I30LAT3: CST? SOkDa rV, Fig. 20 



20 



25 



40 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 

Gly His Asn Asn He Val Arg Cys He Val His Ser Pro Thr Asn Pro 
1 5 10 15 

Gly Phe Met Thr Cys Ser Asp Asp Phe Arg Ala Arg Phe Trp Tyr 
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:o 

(2) INFORMATION FOR SEQ ID NO: 122: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: imJcnown 

10 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



PCTAJS95/01210 



30 



15 



20 



25 



30 



40 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: G- BETA DROSOPH rl. Fig. 23 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 

Gly His Leu Ala Lys He Tyr Ala Met His Trp Gly Asn Asp Ser Arg 
Asn Leu val Ser Ala Ser Gin Asp Gly Lya Leu lie Val Trp Asp 



20 25 
(2) INFORMATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) KYF07HiTIC\L: NO 



30 



45 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: G- BETA DROSOPH rll. Pig. 23 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 



/ 



mi AVAIL(\BLE COPY 
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(Xi) SZQUE^lQE DESC?.I7T:::T: SZO. id 170:125: 

Gly Hia Glu ser Asp He Asn Ala Val Thr Phe Phe Pro Asn Gly Gin 

Ala Phe Ala Thr Gly Ser Asp Asp Ala Thr Cys Arg Leu Phe Asp 
20 25 



10 



15 



30 



35 



30 



(2) INFORMATION FOR SEQ ID NO: 126: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

20 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: BETA DROSOPH rV, Fig. 23 

f^i) SEQUENCE DESCRIPTION: SEQ ID NO: 126: 



ser His Asp Asn lie He Cys Gly He Thr Ser Val Ala Phe Ser Lys 
ser Gly Arg Leu Leu Leu Ala Gly Tyr Asp Asp Phe Asn Cys Asn Val 
Trp Asp 



20 

30 



(2) INFCFi^ATICN FCR SEQ ID NO: 12 7: 



(i) SIOUT-MCE Cli^ulACTERISTICS ! 

^° (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

45 (iii) HYPOTHETICAL: NO 
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(iv) AICTI- SENSE: KO 

(vi) ORIGINAL SOraCE: 

(C) INDIVIDUAL ISOLATE: G- BETA DROSOPH rVI, Fig. 23 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 

Gly His Asp Asn Arg Val Ser Cys Leu Gly Val Thr Glu Asn Gly Met 
10 X 5 10 15 

Ala Val Ala Thr Gly Ser Trp Asp Ser Phe Leu Arg Val Trp Asn 
20 25 30 

15 (2) INFORMATION FOR SEQ ID NO:128: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 
20 (D) TOPOLOGY: unJoiown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 



25 



30 



35 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: G-BETA HUMAN rl. Fig. 24 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128: 

Gly His Asn Gly Trp Val Thr Gin He Ala Thr Thr Pro Gin Phe Pro 
15 10 15 

Asp Mat He Leu Ser Ala Ser Arg Asp Lys Thr He He Met Trp Lys 
20 25 30 



40 (2) INFORMATION FOR SEQ ID N0:129: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
45 (D) TOPOLOGY: unknown 
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(ii) MOLECULE TiPE: peptide 

(iii) HYPOTIJETICAL: WO 

5 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: G-BETA HUMAN rll, Fig. 24 



10 



IS 



20 



25 



35 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 9: 

Gly His Ser His Phe Val Ser Asp Val Val lie Ser Ser Asp Gly Gin 
5 10 15 

Phe Ala Leu Ser Gly Ser Trp Asp Gly Thr Leu Arg Leu Trp Asp 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 130: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

30 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: G-BETA HUMAN rlll, Pig. 24 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:130: 

Gly His Thr Lys Asp Val Leu £er Val Ala Phe Ser Ser Asp Asn Arg 

Gin lie Val Ser Gly Ser Arg Asp Lys Thr He Lys Leu Trp Asn 
20 25 30 

(2) INFORMATION FOR SEQ ID NO:13l: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

5 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 



10 



15 



20 



35 



40 



45 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: G-BETA HUMAN rIV, Fig. 24 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 



Ser His Ser Glu Trp Val Ser Cys Val Arg Phe Ser Pro Asn Ser Ser 
Asn Pro He He Val Ser Cys Gly Trp Asp Lys Leu Val Lys Val Trp 



20 25 
Asn 



25 (2) INFORMATION FOR SEQ ID NO: 132: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
W TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) Ain-I- SENSE: NO 



30 



(vi) C?.IG::L\L SOL^CZ: 

I.: G-^i?A HUT4A:^ rV, Fig. :24 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 

Gly His Thr Gly Tyr Leu Asn Thr Val Thr Val Ser Pro Asp Gly Ser 



10 



15 
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10 



20 



25 



Leu C/3 Ala 5er Gly Gly Lys Asp Gly Gin Ala Met Lau Trp Aap 
20 25 30 * 

(2) INFORMATION FOR SEQ ID NO: 13 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

15 (iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: G-BETA HUMAN rVI, Fig. 24 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133; 



Lys His Leu Tyr Thr Leu Asp Gly Gly Asp He He Asn Ala Leu Cys 

15 



^5 10 



Phe Ser Pro Asn Arg Tyr Trp Leu Cys Ala Ala Thr Gly Pro Ser He 



20 25 

Lys He Trp Asp 
30 35 

(2) INFORMATION FOR SEQ ID NO: 134: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 31 amino acids 

(3) TYPE: amino acid 
(D) TOPOLOGY: xinknown 



30 



v'iii ::cvt:vi3 ty?::: psptids 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

45 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: G-BETA HUMAN rVII, Fig. 24 
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(xi) SEQUENCE DESC.^I?TIC:i: SEQ ID NO: 134: 

Ala Glu Pro Pro Gin Cys Thr Ser Leu Ala Trp Ser Ala Asp Gly Gin 
5 15 10 15 

Thr Leu Phe Ala Gly Tyr Thr Asp Asn Leu Val Arg Val Trp Gin 
20 25 30 

10 (2) INFORMATION FOR SEQ ID NO: 135: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: unknovm 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 



20 



25 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: G-Beta 1 bovine rl, Fig, 21 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135: 

Gly His Leu Ala Lys lie Tyr Ala Met His Trp Gly Thr Asp Ser Arg 
30 1 5 10 15 

Leu Leu Val Ser Ala Ser Gin Asp Gly Lys Leu He He Trp Asp 
20 - 25 30 

35 (2) INFORMATION FOR SEQ ID NO: 136: 

(i) SEQUZ:rCE CMARACTZP.ISTICS : 

(A) LZ!IGTH: 30 a.T.ir.c acids 

40 (D) TOPOLOGY: uo/L^iown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

45 

(iv) ANTI-SENSE: NO 
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(vi) ORIGINAL SOURCE; 

(C) INDIVIDUAL ISOLATE: G-Beta 1 bovine rll. Fig. 21 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 

Gly His Thr Gly Tyr Leu Ser Cys Cys Arg Phe Leu Asp Asp Asn Gin 
1 . 5 10 15 

10 He Val Thr Ser Ser Gly Asp Thr Thr Cys Ala Leu Trp Asp 

20 25 30 

(2) INFORMATION FOR SEQ ID NO: 137: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

20 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

25 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: G-Beta 1 bovine rlll, Fig. 21 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:137: 

Gly His Thr Gly Asp Val Met Ser Leu Ser Leu Ala Pro Asp Thr Arg 
15 10 15 

3 5 Leu Phe Val Ser Gly Ala Cys Asp Ala Ser Ala Lys Leu Tru Asp 

20 25 3o" 

(2) I:J70R;!ATI0:T for SZQ id :;0:133: 

40 (i) SEQUENCE CHARACTER XS'^^ Co : 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



45 (ii) MOLECULE TYPE: peptide 
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(iii) irrPOTHSTICAL: ::0 

(iv) ANTI-32NSE: NO 

5 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: G-Beta 1 bovine rIV, Fig. 21 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138: 

10 

Gly His Glu Ser Asp He Asn Ala He Cys Phe Phe Pro Asn Gly Asn 
15 10 X5 

Ala Phe Ala Thr Gly Ser Asp Asp Ala Thr Cys Arg Leu Phe Asp 
15 20 25 30 

(2) INFORMATION FOR SEQ ID NO: 13 9: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



25 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

30^ (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: G-Beta 1 bovine rV, Fig. 21 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:139: 

35 

Ser His Asp Asn He He Cys Gly He Thr Ser Val Ser Phe Ser Lys 
15 10 15 

C-Iy ArT Lvj Ivi .-^la Gly Tvr Xzp .\3? Vh^ T-^zr. Cys Asn Val 
^0 20 25 30 

Trp Asp 



45 (2) INFORMATION FOR SEQ ID N0:140: 
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(i) CEQUSNCS C;:a?ACTERIS7ICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: airdno acid 
(D) TOPOLOGY: unknown 

5 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

10 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE; 

(C) INDIVIDUAL ISOLATE: G-Beta 1 bovine rVI, Fig. . 21 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140: 

Gly His Asp Asn Arg Val Ser Cys Leu Gly Val Thr Asp Asp Gly Met 
^5 10 15 

Ala val Ala Thr Gly Ser Trp Asp Ser Phe Leu Lys He Trp Asn 



15 



20 



25 



30 



40 



45 



20 



25 



30 



(2) INFORMATION FOR SEQ ID NO: 141: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

35 (iv) ANTI-SENSE: NO 

(vi) 0?.ZG1U>2j SOURCE: 

(C) KroiVirU-U, ISCL.^.TE: G-Beta-bovine (2) rl, rig. 22 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 

Gly His Leu Ala Lys He Tyr Ala Met His Trp Gly Thr Asp Ser Arg 

10 

Leu Leu val Ser Ala Ser Gin Asp Gly Lys Leu He He Trp Asp 
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20 25 30 

(2) IMFORf-IATION FOR SEQ ID NO: 142: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: iintoown 

10 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

15 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: G-Beta -bovine (2) rll. Fig. 22 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 

Gly His Thr Gly Tyr Leu Ser Cys Cys Arg Phe Leu Asp Asp Asn Gin 
^5 10 15 

25 He He Thr Ser Ser Gly Asp Thr Thr Cys Ala Leu Trp Asp 

20 25 30 

(2) INFORMATION FOR SEQ ID NO: 143: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

35 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) M-T-srirCi; :o 

40 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: G-Beta- bovine (2) rlll, Fig. 22 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 



"^ossnnsz Pcr/us95/oi2io 

- 220 - 

Gly His Sar Gly Aap Val M-t Ser Leu Ser Leu Ala Pro Asp Gly Arg 



15 



25 



10 



15 



Thr Phe Val Ser Gly Ala Cys Asp Ala Ser He Lys Leu Trp Asp 
5 20 25 30 

(2} INFORMATION FOR SEQ ID NO: 144: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

20 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: G-Beta-bovine (2) rIV, Fig. 22 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: 

Gly His Glu Ser Asp He Asn Ala Val Ala Phe Phe Pro Asn Gly Tyr 
15 10 15 

Ala Phe Thr Thr Gly Ser Asp Asp Ala Thr Cys Arg Leu Phe Asp 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 145: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 34 amino acids 

(B) l-YPE: amino acid 
(D) TOPOLOGY: unkncvm 



40 



(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

45 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: G-Beta-bovine (2) rV, Fig. 22 
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(xi) SEQUENCE r»3Cr.:PTlCN: £E0 ID NO: 145- 

Ser His Asp Asn He lie Cys Gly He Thr Ser Val Ala Phe Ser Arq 

Ser Gly Arg Leu Leu Leu Ala Gly Tyr Asp Asp Phe Asn Cys Asn He 
20 25 30 

10 Trp Asp 



(2) INFORMATION FOR SEQ ID NO: 146; 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

20 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

25 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: G-Beta-bovine (2) rVI, Fig. 22 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146: 

Gly His Asp Asn Arg Val Ser Cys Leu Gly Val Thr Asp Asp Gly Met 

Ala Val Ala Thr Gly Ser Trp Asp Ser Phe Leu Lys He Trp Asn 
2° 25 30 

(2) INFOPJIATICN FOR SSQ ID NO: 147: 

(i) szcn:::r: CK:\:^-.:Tr-.r :t:cs: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: un3cnown 



30 



35 



45 



(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
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(iv) .\NTI-SEK3E: NO 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE; Be ta2 (Human) rl. Pig. 25 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 

Gly His Leu Ala Lys He Tyr Ala Met His Trp Gly Thr Asp Ser Arg 
^ ^ 10 15 

Leu Leu Val Ser Ala Ser Gin Asp Gly Lys Leu He lie Trp Asp 
20 25 



30 

15 (2) INFORMATION FOR SEQ ID NO: 148: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

25 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

INDIVIDUAL ISOLATE: G.Beta2 (Human) rll. Fig. 25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148: 

Gly His Thr Gly Tyr Leu Ser Cys Cys Arg Phe Leu Asp Asp Asn Gin 



He He Thr Ser Ser Gly Asp Thr Thr Cys Ala Leu Tn^ Asp 
40 (2) INFORMATION FOR SEQ ID .^0:143: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 
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(ii) MOLECULE T'/F2: peptide 

(iii) KYPOTKETIOJi: NO 

5 (iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: G - Be ta2 (Human) rlll, Fig. 25 



10 



IS 



20 



25 



35 



40 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149: 

Gly His Ser Gly Asp Val Met Ser Leu Ser Leu Ala Pro Asp Gly Arg 
15 10 15 

Thr Phe Val Ser Gly Ala Cys Asp Ala Ser He Lys Leu Trp Asp 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 150: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

30 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: G - Be ta2 (Human) rIV. Pig. 25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 

Gly :iis Glu Ser A^p lie Asn Ala Val Ala Phe Phe Pro Asn Gly Tyr 
- 3 -.0 15 

Ala Phe Thr Thr Gly Ser Asp Asp Ala Thr Cys Arg Leu Phe Aju 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 151: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LSNGTH: 34 smino acids 

(B) TYPE: amino r.-i 1 
(D) TOPOLOGY: unknown 

5 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

10 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: G- Be ta2 (Human) rV, Pig. 25 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151: 

Ser His Asp Asn He He Cys Gly He Thr Ser Val Ala Phe Ser Arg 
1 • 5 10 15 



20 



25 



Ser Gly Arg Leu Leu Leu Ala Gly Tyr Asp Asp Phe Asn Cys Asn He 
20 25 30 

Trp Asp 

(2) INFORMATION FOR SEQ ID NO: 152: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 
30 (B) TYPE: amino acid 

(D) TOPOLOGY: unJcnown 

(ii) MOLECULE TYPE: peptide 

35 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

■'•i) C?IGI:t.\L :rOT.-7.Ct: 

(C) INDIVIDUAL ISOLATE: G-3eta2 (Human) rVI, Pig. 25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152: 

45 Gly His Asp Asn Arg Val Ser Cys Leu Gly Val Thr Asp Asp Gly Met 

1 5 10 15 
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Ala Val Ala Thr Gly s^^r Trp Asp s^r Phe Leu Lys He TrrD ; sn 

20 or 



50 



5 



10 



(2) INFORMATION FOR SEQ ID NO: 153: 

(i) SEQUENCE CHARACTERISTICS: ' 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

15 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: G-Beta4 (mouse) rl. Fig. 26 



20 



25 



30 



35 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153: 

Gly His Leu Ala Lys He Tyr Ala Met His Trp Gly Tyr Asp Ser Arg 
Leu Leu val Ser Ala Ser Gin Asp Gly Lys Leu He He Trp Asp 



20 25 
(2) INFORMATION FOR SEQ ID NO: 154: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



30 



(ii) MOLECULE r/PE: peptide 

(iii) HY:>C7H£riC^L: NO 

40 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: G- Bet a4 (mouse) rll. Pig. 26 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154: 
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Gly His Thr Gly Tyr Leu Ser Cys Cys Arg Ph- Leu Asp Asp Gly Gin 
1 5 io as 

He He Thr Ser Ser Gly Asp Thr Thr Cys Ala Leu Trp Asp 
S 20 25 30 

(2) INFORMATION FOR SEQ ID NO:155: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



15 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

20 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: G-Beta4 (mouse) rlll. Fig. 26 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155: 

25 

Gly His ser Gly Asp Val Met Ser Leu Ser Leu Ser Pro Asp Leu Lys 
15 10 15 

Thr Phe Val Ser Gly Ala Cys Asp Ala Ser Ser Lys Leu Trp Asp 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 156: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 31 amino acids 

(3) TYPE: amino acid 
(D) TOPOLOGY: un>mown 



40 



(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



45 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: G-Beta4 (mouse) rIV, Fig. 26 
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(xi) SEQUENCE DESCRZ7-:Z01J: SEQ ID NO: 156: 

Gly His He Ser Asp He Asn Ala Val Ser Phe Phe Pro Ser Gly Tyr 

Ala Phe Ala Thr Gly Ser Asp. Asp Ala Thr Cys Arg Leu Phe Asp 
20 25 



20 



25 



30 



30 



10 (2) INFORMATION FOR SEQ ID NO: 15 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: G-Beta4 (mouse) rV, Fig. 26 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157: 

Ser His Asp Asn He He Cys Gly He Thr Ser Val Ala Phe Ser Lys 

ser Gly Arg Leu Leu Leu Ala Gly Tyr Asp Asp Phe Asn Cys Ser Val 
'° 25 30 



35 Trp Asp 



(2) IlM'FOKMVnCN I'OR S£Q ID ::0:158: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

MOLECULE TYPE: peptide 
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(iii) HVFOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

5 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: G-Beta4 (mouse) rVI. Pig. 26 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158: 

10 

Gly His Asp Asn Arg Val Sex Cys Leu Gly Val Thr Asp Asp Gly Met 
15 10 15 

Ala Val Ala Thr Gly Ser Trp Asp Ser Phe Leu Arg He Trp Asn 
15 20 25 30 

(2) INFORMATION FOR SEQ ID NO:159: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



25 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

30 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: GROUCHO PROT, DRSPH rl, Fig. 27 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159: 

35 

Thr Ser Ala Ala Pro Ala Cys Tyr Ala Leu Ala Ser Pro Asp Ser Lys 
15 10 15 

7.U Cya lh^ C3r Cys r--, j.-r A=p Gly A:£n lie Ala V3.I Trp As? 
4^ 20 25 30 

(2) INFORMATION FOR SEQ ID NO: 160: 

(i) SEQUENCE CHARACTERISTICS: 

45 (A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: unkncrn 

(ii) KOLSCJLE TYPE: peptide 

5 (ill) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 
10 (C) INDIVIDUAL ISOLATE: GROUCHO PROT. DRSPH rll. Fig. 27 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160: 

15 Gly His Thr Asp Gly Ala Ser Cys lie Asp lie Ser Pro Asp Gly Ser 

15 10 15 

Arg Leu Trp Thr Gly Gly Leu Asp Asn Thr Val Arg Ser Trp Asp 
20 25 30 



20 



40 



(2) INFORMATION FOR SEQ ID NO: 161: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 
25 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

30 (iii) HYPOTHETICAL: NO 

<iv) ANTI-SENSE: NO ' 

(vi) ORIGINAL SOURCE: 
35 (C) INDIVIDUAL ISOLATE: GTP binding prt squid rl. Fig. 28 

(xi) SECUZNCi DESCRIPTION: SEQ ID ::0:161: 



45 



Gly His Leu Ala Lys lie Tyr Ala Met His Trp Ala Ser Asp S : 
15 10 li 

Asn Leu Val Ser Ala Ser Gin Asp Gly Lys Leu lie Val Trp Asp 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 162: 
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(i) ^SQUZlICri CHARACTZR:5T:C3: 

(A) LE>;gtH: 30 anv.:.o 

(B) TYPS: amino acid 
(D) TOPOLOGY: unknovm 

5 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
10 (iv) Am"! -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: GTP binding prt squid rll, Fig. 28 



15 



20 



25 



30 



40 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:162: 

Gly His Thr Gly Tyr Leu Ser Cys Cys Arg Phe lie Asp Asp Asn Gin 
15 10 15 

He Val Thr Ser Ser Gly Asp Met Thr Cys Ala Leu Trp Asn 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 163: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

35 (iv) ANTI- SENSE: NO 

(vi) ORIGINAL SC-.?.CE: 

(C) IITE I VIZL-;%L ISOLATE: GTP binding prt squid rlll. Fig. 28 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163: 

Gly His Thr Gly Asp Val Met Ser Leu Ser Leu Ala Pro Asp Mejt Arg 
15 10 15 

Thr Phe Val Ser Gly Ala Cys Asp Ala Ser Ala Lys Leu Phe Asp 
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20 25 30 
(2) INFORMATION FOR SEQ ID NO: 164: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

10 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

15 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: GTP binding prt squid rIV, Fig. 28 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164: 

Gly His Glu Ser Asp lie Asn Ala lie Thr Tyr Phe Pro Asn Gly Phe 
1 5 10 15 

25 Ala Phe Ala Thr Gly Ser Asp Asp Ala Thr Cys Arg Leu Phe Asp 

20 25 30 

(2) INFORMATION FOR SEQ ID NO: 165: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

35 (ii) MOLECULE TYPE: peptide 

(iii) H^fPOTIIETICAL: NO 



40 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: GTP binding prt squid rV, Fig. 28 



45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 5: 
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20 



25 



30 



232 - 

Ser His A3P Asa He lie Cy. Gly Thr Ser Val Ala Ph3 Ser Lys 
^5 -0 



15 



ser Gly Arg Leu Leu Leu Gly Gly Tyr Asp Asp Phe Asn Cys Asn Val 

25 30 

Trp Asp 



10 (2) INFORMATION FOR SEQ ID N0:166: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEITGTH: 24 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: xmknown 



(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: GTP binding prt squid rVI, Fig. 28 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166: 

Gly His Asp Asn Arg Val Ser Cys Leu Gly Val Thr Glu Asp Gly Met 
^5 10 



35 



40 



Ala Val Ala Thr Gly Ser Trp Asp 
20 

(2) INFORMATION FOR SEQ ID NO: 167: 

(i) SEQU3i;C2 CHARACTERISTICS: 

(A) LEIFGTH: 32 amino acids 
(3) r/PS: anino acid 
{D\ rOPOLOGV: urj-nc .-n 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 



15 



45 



(iv) ANTI -SENSE: NO 
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(vi) ORIGINAL SOUUCE: 

(C) INDIVIDUAL IS0LAT3: lEF S5? 93 06 rl, Fig. 25 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167: 

Gly His cm Lys Glu Gly Tyr Gly Leu Ser Trp Asn Pro Asn Leu Ser 



10 



15 



20 



30 



35 



10 



IS 



Gly His Leu Leu Ser Ala Ser Asp Asp His Thr He Cys Leu Trp Asp 



20 25 



(2) INFORMATION FOR SEQ ID NO: 168: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



30 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

25 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: lEF SSP 9306 rll. Fig, 29 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 168: 

Gly His Thr Ala Val Val Glu Asp Val Ser Trp His Leu Leu His Glu 
^ 5 10 



15 



Ser Leu Phe Gly Ser Val Ala Asp Asp Gin Lys Leu Met lie Trp Asp 
=^ 25 30 



40 (2) INFORMATION FOR SEQ ID N0:169: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 
^5 (D) TOPOLOGY: unknown 
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(ii) MOLECLTiE TYPE: psptici? 

(iii) KYFOTHSTICAL: NO 

5 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: lEF SSP 9306 rlll. Pig. 29 



10 



IS 



30 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 169: 

Ser His Ser Val Asp Ala His Thr Ala Glu Val Asn Cys Leu Ser Phe 
15 10 15 

Asn Pro Tyr Ser Glu Phe He Leu Ala Thr Gly Ser Ala Asp Lys Thr 



20 25 

Val Ala Leu Trp Asp 
20 35 

(2) INFORMATION FOR SEQ ID N0:170: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



30 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

35 (vi) ORIGINAL SOURCE: 

(C) lOTIVIDUAL ISOLATE: lEF SSP 9306 rIV, Fig. 29 



SEQUENCE DZSCnirTIC^T : S'Q- ID MD:17:: 

40 

Leu His Ser Phe Glu Ser His Lys Asp Glu He Phe Gin Val Gin 
3-5 10 15 

Ser Pro His Asn Glu Thr He Leu Ala Ser Ser Gly Thr Asp Arg Arg 
45 20 25 30 
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Leu Asn Val Trp Asp 
35 

(2) INFORMATION FOR SEQ ID NO: 171: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

10 



20 



25 



30 



PCT/US9S/01210 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

15 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: lEF SSP 9306 rV, Fig. 29 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171: 

He Gly Glu Glu Gin Ser Pro Glu Asp Ala Glu Asp Gly Pro Pro Glu 
^ 5 10 



15 



Leu Leu Phe He His Gly Gly His Thr Ala Lys lie Ser Asp Phe Ser 

25 30 

Trp Asn 



(2) INFORMATION FOR SEQ ID N0:172: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 
(3) TYPE: amino acid 
(D) TC70LCGY: unknown 



40 



(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

45 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: HUMAN 12.3 rl. Fig. 30 
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10 



30 



35 



1S - 



(xi) SEQUENCE DESCRII/TIOIT : 3FQ ID ::0;172: 

Gly His Asn Gly Trp Val Thr Gin He Ala Thr Thr Pro Gin Phe Pro 
15 10 15 

Asp Met He Leu Ser Ala Ser Arg Asp Lys Thr He He Met Trp Lys 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 173: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 
15 (B) TYPE: amino acid 

(D) TOPOLOGY: unknovm 

(ii) MOLECULE TYPE: peptide 

20 (iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 
25 (C) INDIVIDUAL ISOLATE: HUMAN 12.3 rll, Fig. 30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:173: 



Gly His Ser His Phe Val Ser Asp Val Val He Ser Ser Asp Gly Gin 
15 10 15 

Phe Ala Leu Ser Gly Ser Trp Asp Gly Thr Leu Arg Leu Trp Asp 
20 25 30 

(2) INFORI-'ATICN FOR SEQ ID NO: 174: 



(i) SZOUEJICE CM^'JU'.CTZRISTICS: 

40 (B) TYPE: amino acid 

(D) TOPOLOGY: un3cnovni 

(ii) MOLECULE TYPE: peptide 

45 (iii) HYPOTHETICAL: NO 
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10 



25 



30 



35 
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(iv) ANT2-5EN33: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: HUMAN 12.3 rlll. Pig.' 30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 174: 

Gly His Thr Lya Asp Val Leu Ser Val Ala Phe Ser Ser Asp Asn Arg 



10 



15 



Gin He val Ser Gly Ser Arg Asp Lys Thr He Lys Leu Trp Asn 
2° 25 30 

15 (2) INFORMATION FOR SEQ ID NO:175: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 
20 (D) TOPOLOGY: unJoiown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: HUMAN 12.3 rIV, Fig. 30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:175: 

Ser His Ser Glu Trp Val Ser Cys Val Arg Phe Ser Pro Asn Ser Ser 

Asn Pro He li^ Val Ser Cys Gly Trp Asp Lys Leu Val Lys Val Trp 

23 30 

40 Asn 



(2) INFORMATION FOR SEQ ID NO: 176: 

^5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 
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(B) TYPE: arr.inc a -rid 
(D) TOPOLOGY: ur...... .vi 

(ii) MOLECULE TYPE: peptide 

5 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

10 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: HUMAN 12.3 rV, Fig. 30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176: 

15 

Gly His Thr Gly Tyr Leu Asn Thr Val Thr Val Ser Pro Asp Gly Ser 
15 10 15 

Leu Cys Ala Ser Gly Gly Lys Asp Gly Gin Ala Met Leu Trp Asp 
20 . 20 25 30 

(2) INFORMATION FOR SEQ ID NO:177: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 36 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unJcnown 



30 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: HUMAN 12.3 rVI, Fig. 30 

arrjzziizz dh:£c?.ipti::j: z^q :d i^or^ir, 

Lys His Leu Tyr Thr Leu Asp Gly Gly Asp He He Asn Ala Leu Cys 
15 10 15 

Phe Ser Pro Asn Arg Tyr Trp Leu Cys Ala Ala Thr Gly Pro Ser He 
20 25 30 



40 
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Lys He Trp A£.p 
35 

(2) INFORMATION FOR SEQ ID NO:178: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 38 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unJcnown 

10 



30 



PCTAJS95/01210 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

15 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: HUMAN 12.3 rVII, Fig. 30 



20 • 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 178: 

Val lie ser Thr Ser Ser Lys Ala Glu P.o Pro Gin Cys Thr Ser Leu 

" 15 
Ala Trp Ser Ala Asp Gly Gin T^r Leu Phe Ala Gly ryr Thr Asp Asn 



30 



35 



40 



Leu Val Arg Val Trp Gin 
35 

(2) INFORMATION FOR SEQ ID NO: 179: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 
(3) TYPE: amino acid 
(D) TOPOLOGY: unknown 



(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

45 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: IEF-7442. human rl. Fig, 31 
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(xi) SEQUENCE DSSCRIPTION: ."^3 ID :70:173: 

Gly His Gin Lys Glu Gly Tyr Gly Leu Ser Trp Asn Ser Asn Leu Ser 
5 10 „ 

Gly His Leu Leu Ser Ala Ser Asp Asp His Thr Val Cys Leu Txp Asp 
20 25 



10 



15 



25 



30 



30 



(2) INFORMATION FOR SEQ ID NO: 180: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 
20 (iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: lEF- 7442 -human rll. Fig. 3i 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:180: 

Gly His ser Ala Val Val Glu Asp Val Ala Trp His Leu Leu His Glu 

ser Leu Phe Gly Ser Val Ala Asp Asp Gin Lys Leu Met He Trp Asp 
20 9c 
35 30 

(2) rJFCPJ^UVTICN FOR SEQ ID NO: 181: 



40 



45 



(A) LENGTH: 32 amxno acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
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(iv) AJrrr-S2JfSE: NO 

(vi) ORIGINAL SOURCE: 
^ (C) INDIVIDOAL ISOLATE: lEF- 7442 -human rlll. Fig. 31 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181: 

Ala His Thr Ala Glu Val Asn Cys Leu Ser Phe Asn Pro Tyr Ser Glu 
^ 10 . 15 

Phe lie Leu Ala Thr Gly Ser Ala Asp Lys Thr Val Ala Leu Trp Asp 
20 



15 



20 



30 



35 



40 



(2) INFORMATION FOR SEQ ID NO: 182: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



25 



30 



(ii) MOLECULE TYPE: peptide 
25 (iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: lEF- 74 4 2. human riV, Fig. 31 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 182: 

val His Trp ser Pro His Asn Glu Thr He Leu Ala Ser Ser Gly Thr 



10 



15 



^sp Arj Arg Leu Asn Vai Trp Asp 



(2) INFORMATION FOR SEQ ID NO:183: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 
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(ii) HOhEZ^JLtl TYPE: peptide 

(iii) HYPOTHETICAL: NO 

5 (iv) ANTI- SENSE; NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOiATE: lEF- 7442 -human rV, Pig. 31 



10 



15 



30 



35 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 183: 

Gly His Thr Ala Lys He Ser Asp Phe Ser Trp Asn Pro Asn Glu Pro 
Trp val He Cys Ser Val Ser Glu Asp Asn lie Met Gin He Trp Gin 



20 25 



30 



20 (2) INFORMATION FOR SEQ ID NO:184: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Insulin-like GF binding 
protein complex rl, Fig. 32 



(xi) S2CUENC3 DSSCrilFTXCM: SZQ ID NO: 194: 

Ala His Thr Pro Ala Leu Ala Ser Leu Gly Leu Ser Asn Asn Arg 
ser Arg Leu Glu Asp Gly Leu Phe Glu Gly Leu Gly Ser Leu Trp Asp 



20 



45 



25 



30 
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(2) INFOIU-IATICN FOR SZQ ID i;0:185: 

(i) SEQUENCE CKARACTZRISTICS : 

(A) LENGTH: 32 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

10 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



15 



20 



25 



35 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Insulin-like growth factor bind, 
pro. conpl ex-rat rl. Fig. 33 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:185: 

Thr His Thr Pro Ser Leu Ala Ser Leu Ser Leu Ser Ser Asn Leu Leu 



10 



15 



Gly Arg Leu Glu Glu Gly Leu Phe Gin Gly Leu Ser His Leu Trp Asp 



20 25 



30 



(2) INFORMATION FOR SEQ ID NO: 186: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 

(iii) HYFOTHETIC\L: 110 

(iv) ANTI- SENSE: NO 

40 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Insulin-like growth factor bin: 
pro. con^lex-rat rll. Fig. 33 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 186: 
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20 



25 



244 



Asn His Leu Glu Thr Leu Ala Glu Gly Leu Phe Ser Ser ;.cu Gly Arg 
^ ^ 1^ 15 

val Arg Tyr Leu Ser Leu Arg Asn Asn Ser Leu Gin Thr Phe Ser Pro 
2° 25 30 

Gin Pro Gly Leu Glu Arg Leu Trp Leu Asp Ala Asn Pro Trp Asp 
35 40 45 

10 (2) INFORMATION FOR SEQ ID NO: 187: 

(i) SEQUENCE C3IARACTERISTICS : 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: LISl (human) rl. Fig. 34 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 187: 



Gly His Arg Ser Pro Val Thr Arg Val He Phe His Pro Val Phe Ser 
val Met Val Ser Ala Ser Glu Asp Ala Thr He Lys Val Trp Asp 

35 (2) INFORMATION FCR SEQ ID NO:188: 

(i) SZQUZNCE C;LhRACTEHISTICS : 

(A) LSIiGTH: 31 a?nino acids 
(^) *r:?E: sri.io ^cid 
(D) TOPOLOGY: unJcnown 

(ii) MOLECULE TYPE; peptide 

(iii) HYPOTHETICAL: NO 

45 

(iv) ANTI- SENSE: NO 
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(vi) OarCIIiAL SCURCE: 

(C) INDIVIDUAL ISCLATZ: LISl (hur.-jr.) rll. Fir. Ci 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 188: 

Gly His Thr Asp Ser Val Gin Asp He Ser Phe Asp His.Ser Gly Lys 



10 



35 



10 IS 



Leu Leu Ala Ser Cys Ser Ala Asp Met Thr He Lys Leu Trp Asp 
20 25 30 

(2) INFORR^TION FOR SEQ ID NO: 189: 

^5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: iinknown 

20 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

25 

(vi), ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: LISl (human) rlll, Fig. 34 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 189: 

Gly His Asp His Asn Val Ser Ser Val Ala He Met Pro Asn Gly Asp 



5 ,0 15 

His He val ser Ala Ser Arg Asp Lys Thr He Lys Met Trp Glu 
20 25 30 



(2) IN?CE:.1\TIs:N JCH SrQ id :r0:190: 

^0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: \in3cnown 

45 (ii) MOLECULE TYPE: peptide 
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(iii) KV?0TH3TIOV.: i:0 

(iv) ANTI- SENSE: NO 

5 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: LISl (human) rIV, Pig. 34 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 190: 

10 

Gly His Arg Glu Trp Val Arg Met Val Arg Pro Asn Gin Asp Gly Thr 
15 10 15 

Leu lie Ala Ser Cys Ser Asn Asp Gin Thr Val Arg Val Trp Val 
^5 20 25 30 

(2) INFORMATION FOR SEQ ID NO: 191: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



25 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

30 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: LISl (human) rV, Fig. 34 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191: 

Gly Ser Glu Thr Lys Lys Ser Gly Lys Pro Gly Pro Phe Leu Leu Ser 
^5 10 15 

vJ'ly Ces Arg ;3p Li'j Vrr '.yz y.az Ti-J 
40 20 25 

(2) INFORMATION FOR SEQ ID NO: 192: 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: unknown 

(ii) MOLECULE r/PE: peptide 

5 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: LISl (human) rVI, Pig, 34 

(xi) SSQL^CE DESCRIPTION: SEQ ID NO: 192: 

Gly His Asp Asn Trp Val Arg Gly Val ^eu Phe His Ser Gly Gly Lys 

Phe He Leu Ser Cys Ala Asp Asp Lys Thr Leu Arg Val Trp Asp 
20 25 



10 



15 



20 



35 



40 



45 



30 



(2) INFORMATION FOR SEQ ID NO: 193; 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: un3aiown 

(ii) MOLECULE TYPE: peptide 

30 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: LISl (human) rVII, Fig. 34 

(xi) SZZ'T£:K2 DS'SCRIPTION: SEQ ID NO:133: 

Ala His Glu His Phe Val Thr Ser Leu Asp Phe His Lys Thr : 



Tyr val Val Thr Gly Ser Val Asp Gin Thr Val Lys Val Trp Glu 
20 

(2) INFORMATION FOR SEQ ID NO; 194: 



25 30 
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(i) CKA1<ACT2RISTICS : 

(A) LENGTH: 29 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unJaiown 

5 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTEffiTICAL: NO 

^0 (iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: MD6 rl, Pig. 35 

15 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:194: 

Gly His Ser Ala Arg Val Tyr Ala Leu Tyr Tyr Lys Asp Gly Leu Leu 

20 ' 15 

Cys Thr Gly Ser Asp Asp Leu Ser Ala Lys Leu Trp Asp 
2° 25 



25 



30 



40 



(2) INFORMATION FOR SEQ ID NO: 195: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: un3cnown 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

35 (iv) ANTI -SENSE: NO 

(vi) CHIGi:u\L SOURCE: 

(C) IIDIVIDUAL I30LAT-: ^36 rll, ?ig. 35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 195: 

Thr His Thr cys Ala Ala Val Lys Phe Asp Glu Gin Lys Leu Val Thr 
5 10 15 

Gly Ser Phe Asp Asn Thr Val Ala Cys Trp Glu 
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20 25 

(2) INFORTIATION FOR SEQ ID NO: 196: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

10 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

15 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: MD6 rlll, Fig. 35 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 196: 

Gly His Thr Gly Ala Val Phe Ser Val Asp Tyr Ser Asp Glu Leu Asp 
15 10 15 

25 He Leu Val Ser Gly Ser Ala Asp Phe Ala Val Lys Val Trp Ala 

20 25 30 

(2) INFORMATION FOR SEQ ID NO: 197: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

35 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: MO 

(iv) A:m-:.-r.--: ::.-) 

40 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: MD6 rIV, Fig. 35 

SEQUENCE DESCRIPTION: SEQ ID NO: 197: 
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Gly Hi3 Thr Glu Trp Val Thr Lys Val Val Leu Gin Lys Cys Lys Val 
1 5 10 • ' 15 

Lys Ser Leu Leu His Ser Pro Gly Asp Tyr He Leu Leu Ser Ala Asp 
20 25 30 

Lys Tyr Glu He Lys He Trp Pro 
35 40 



10 (2) INFORMATION FOR SEQ ID NO: 198: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LEITGTH: 32 arr.ino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: untaown 



(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



20 



25 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: MSLl rl, Fig, 36 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 198: 



Lys His Asp Gly Gly Val Asn Ser Cys Arg Phe Asn Tyr Lys Asn Ser 

Leu He Leu Ala Ser Ala Asp Ser Asn Gly Arg Leu Asn Leu Trp Asp 
20 25 30 

35 

(2) irFCRMATION FOR SEQ ID NO: 199: 



(1) SEQU211CZ CriARACTERZSTICS : 

(A) LlViZTxl: 33 *: -:Lzz 

(B) TYPE: amino acid 
(D) TOPOLOGY: un3cnown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
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(iv) AKTI- SENSE: NO 

(vi) ORIGINAL SOUr.CE! 

(C) INDIVIDUAL ISOLATE: MSLl rll, Fig. 36 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 199: 

Glu His Gly Thr Ser Val Ser Thr Leu Glu Trp Ser Pro Asn Phe Asp 



10 1 5 10 



40 



15 



Thr Val Leu Ala Thr Ala Gly Gin Glu Asp Gly Leu Val Lys Leu Trp 
20 25 30 

15 Asp 

(2) INFORMATION FOR SEQ ID NO:200: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: iinJcno%m 

25 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

30 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: MSLl rlll. Fig. 35 

35 Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 00: 

Gly His LGU Gly V^l Asn Asp He Ser Trp Asd Ala His Asp Pro 

Trp Leu Met Cys Ser Val Ala Asn Asp Asn Ser Val His He Tl^. . : 
20 25 30 ^ 



45 



(2) INFORMATION FOR SEQ ID NO: 201: 
(i) SEQUENCE CHARACTERISTICS: 



2 



PCTAJS95/01210 
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(A) LENGTH: 31 an'ir^o ::cids 

(B) TYPE: amino njid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
(iv) AKTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: MUS MUSCULUS PROTEIN rl. Fig. 37 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:201: 

Gly His Ser Gly Cys Val Asn Thr Val His Phe Asn Gin His Gly Thr 
15 10 15 

Leu Leu Ala Ser Gly Ser Asp Asp Leu Lys Val lie Val Trp Asp 
20 25 30 

2) INFORMATION FOR SEQ ID NO: 202: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOIiECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) lytDV/ZDUXL ISOLATE: MUS MUSCULUS PROTEIN rll. Fig. 37 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:202: 

Gly His He Phe He Trp Glu Lys Ser Ser Cys Gin He Val Gin ?h2 
15 10 15 



Leu Glu Ala Asp Glu Gly Gly Thr He Asn Cys He Asp Ser His Pro 
20 25 30 
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Tyr Leu Fro Val Lea Ala Ser Ser Gly Leu Asp His Glu Val Lys lie 
. 35 40 

Trp Ser 
5 50 



(2) INFORMATION FOR SEQ ID NO: 203: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

15 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

20 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: ORF RBI rl. Pig. 38 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:203: 

Lys His Asp Gly Gly Val Asn Ser Cys Arg Phe Asn Tyr Lys Asn Ser 



30 



35 



40 



10 



15 



Leu He Leu Ala Ser Ala Asp Ser Asn Gly Arg Leu Asn Leu Trp Asp 
2° 25 30 



(2) INFORMATION FOR SEQ ID NO: 204: 

(i) SEQUENCE CHARACTERISTICS: 

(.\) LENGTH: 33 amino acids 
\3) TYPE: aniino acid 



(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
45 (iv) ANTI-SENSE: NO 
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(vi) ORIGINAL SOUK.cz : 

(C) INDIVIDUAL IS0L.H7Z: ORF EBl rll, ?ig. 33 



10 



15 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 204: 

Glu His Gly Thr Ser Val Ser Thr Leu Glu Trp Ser Pro Asn Phe Asp 
^5 10 15 

Thr Val Leu Ala Thr Ala Gly Gin Glu Asp Gly Leu Val Lys Leu Trp 
20 25 30 

Asp 

(2) INFORMATION FOR SEQ ID NO: 205: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 
20 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

25 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: ORF RBI rlll. Fig. 38 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 05: 

35 Gly His Met Leu Gly Val Asn Asp He Ser Trp Asp Ala His Asp Pro 

1 5 5 0 



15 



Trp Leu Met Cys --3r Val Ala Asn A..? Asn Ser Val His He Trp Lys 



40 



(2) INFORMATION FOR SEQ ID NO: 2 06: 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: '^Tl-inovn 

(ii) MOLECULE TYPE: peptide 

5 (iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 



10 



15 



20 



25 



30 



40 



45 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Periodic Trp prt rl. Fig. 39 

(xi) SEQUENCE DESCRIPTICN: SEQ ID NO: 2 06: 

Gly His He Thr Thr His His Thr Asp Ala Val Leu Ser Met Ala His 



10 



15 



Asn Lys Tyr Phe Arg Ser Val Leu Ala Ser Thr Ser Ala Asp His Thr 



20 25 

Val Lys Leu Trp Asp 
35 

(2) INFORMATION FOR SEQ ID NO: 207: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: untoown 



30 



(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
35 (iv) Aim -SENSE: NO 

(vi) c?.ic-i::al scu?.cz: 

(C) i:,-DIV-DUAL rsoU^TS: Periodic Trp prt rll, Fig. 39 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 207: 

He His Ser Asn Lys Asn Val Ser Ser Ser Glu Trp His Met Leu Asn 
^ 5 10 15 

Gly Ser He Leu Leu Thr Gly Gly Tyr Asp Ser Arg Val Ala Leu Thr 



wo 95/: 1252 



PCTAJS95/01210 



30 

Asp Val Arg He Ser Asp Glu Ser Gin Met Ser Lys Tyr Trp Ser 
35 40 45 

5 

(2) INFORMATION FOR SEQ ID NO: 208: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 
10 (B) TYPE: amino acid 

(D) TOPOLOGY: imJoiown 

(ii) MOLECULE TYPE: peptide 

15 (iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 



20 



25 



30 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: PLAP rl, Fig. 4 0 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:208: 

Gly His Lys Asp Thr Val Cys Ser Leu Ser Ser Gly Lys Phe Gly Thr 
15 10 15 

Leu Leu Ser Gly Ser Trp Asp Thr Thr Ala Lys Val Trp Leu 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 209: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 
35 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) ;>I0L2CULE TYPE: peptide 

40 (iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 



45 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: PLAP rll, Fig, 40 
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(xi) SEQUE:;:-3 DSoCRIPTION: S2Q id UO:209: 

Gly His Thr Ala Ala Val Trp Ala Val Lys He Leu Pro Glu Gin Gly 
5 ^ ^ " 15 

Leu Met Leu Thr Gly Ser Ala Asp Lys Thr He Lys Leu Trp Lys 
20 25 



10 



15 



25 



30 



30 



(2) INFORMATION FOR SEQ ID NO: 210: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 
(3) TYPE: amino acid 
(D) TOPOLOGY: xm3aaown 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

20 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: FLAP rlll, Fig. 40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:210: 

Gly His Glu Asp cys Val Arg Gly Leu Ala He Leu Ser Glu Thr Glu 

^ 10 15 

Phe Leu ser Cys Ala Asn Asp Ala Ser lie Arg Arg Trp Gin 



20 



30 



35 



40 



(2) INFORMATION FOR SEQ ID N0:211: 

(i) SEQUENCE CILi^^CTSRISTICS : 

(A) ZE::zT'A: h a-ino acids 
irl) TY:-2: arr.ir.o acid 
-wGV: unJ-ricwa 



(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
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(vi) ORIGIN"AL SOURCE: 

(C) INDIVIDUAL JSClSi.:Z: PLAP rIV, Fig. 40 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:211: 

Gly His Thr Asn Tyr He Tyr Ser He Ser Val Phe Pro Asn Ser Lys 



10 



25 



30 



35 



10 



15 



Asp Phe Val Thr Thr Ala Glu Asp Arg Ser Leu Arg He Trp Lys 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 212: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

20 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: RETINOBLASTOMA BINDING PROTEIN - 
HUMAN, rl. Fig. 41 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 212: 

Gly His Gin Lys Glu Gly Tyr Gly Leu Ser Trp Asn Pro Asn Leu Ser 



10 



15 



Gly Ei3 Leu Leu Ser Ala Ser Asp Asp His Thr He Cys Lau Trp Asp 

20 5.:: 



30 



(2) INFORMATION FOR SEQ ID NO: 213: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 
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20 



40 
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(ii) K0LECUL3 TYPE: pectida 

(iii) HYPOTHrlTICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: RETINOBLASTOMA BINDING PROTEIN 
HUMAN rll. Fig. 41 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:213: 

Gly His Thr Ala Val Val Glu Asp Val Ser Trp His Leu Leu His Glu 
15 1 5 10 15 

Ser Leu Phe Gly Ser Val Ala Asp Asp Gin Lys Leu Met He Trp Asp 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 2 14: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 
25 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

30 (iii) HYPOTHETIC^,: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 
^5 <C) INDIVIDUAL ISOLATE: RETINOBLASTOMA BINDING PROTEIN 

HCT^AN rlll, Fig. 41 



Ser His Ser Val Asp Ala His Thr Ala Glu Val Asn Cys Leu Ser 

15 10 15 

Asn Pro Tyr Ser Glu Phe He Leu Ala Thr Gly Ser Ala Asp Lys Thr 
45 20 25 30 
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5 



10 



30 
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Vai Ala Leu Trp Asp 
35 

(2) INFORMATION FOR SEQ ID NO: 215: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

15 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: RETINOBLASTOMA BINDING PROTEIN - 
HUMAN rIV, Fig. 41 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:215: 

Ser His Lys Asp Glu lie Phe Gin Val Gin Trp Ser Pro His Asn Glu 
25 1 5 10 15 

Thr lie Leu Ala Ser Ser Gly Thr Asp Arg Arg Leu Asn Val Trp Asp 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 216: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 
35 (B) TYPE: amino acid 

(D) TOPOLOGY: unknc/oi 

(ii) :!C1EC-JL3 T-iPE: peptide 

40 uii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 
45 (C) INDIVIDUAL ISOLATE: RETINOBLASTOMA BINDING PROTEIN 

HUMAN rV, Fig. 41 
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ixi) SEQUENCE DESCR:?7ICN: CZ:: ID NO:216: 

Gly His Thr Ala Lys He Ser Asp Phe Ser Trp Asn Pro Asn Glu Pro 
5 15 10 15 

Trp Val He Cys Ser Val Ser Glu Asp Asn He Met Gin Val Trp Gin 



10 



30 



35 



20 25 



(2) INFORMATION FOR SEQ ID NO:217: 



30 



(i) Sr.QUENCE CHARACTERISTICS: 

(A) LENGTH; 30 amino acids 
15 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

20 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 
25 (C) INDIVIDUAL ISOLATE: S253 PROTEIN rl. Fig, 42 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:217: 



Glu His Ala Leu Asp He Leu Asp Ala Asn Trp Ser Lys Asn Gly Phe 
15 10 15 

Leu He Thr Ala Ser Met Asp Lys Thr Ala Lys Leu Trp His 
20 25 30 

(2) INFOHT'IATION FOR SEQ ID NO: 218: 

(i) SZQUZIKZ CIL-'i^ACTERISTICS : 



3- r.mir.o a = ills 



40 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

45 (iii) HYPOTHETICAL: NO 
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(iv) ANTI-SH:73E: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: S253 PROTEIN rll. Fig. 42 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:218: 

Val His Pro Asp Phe Val Thr Ser Ala lie Phe Phe Pro Asn Asp Asp 

Arg Phe He He Thr Gly Cys Leu Asp His Arg Cys Arg Leu Trp Ser 



10 



15 



30 



35 



40 



20 25 



(2) INFORMATION FOR SEQ ID NO: 219: 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 
20 (B) TYPE: amino acid 

(D) TOPOLOGY: unJcnown 

(ii) MOLECULE TYPE: peptide 

25 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: SOFl rl, Fig. 43 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 219: 

Gly His Arg Asp Gly Val Tyr Ala He Ala Lys Asn Tyr Gly Ser Leu 



10 



15 



Asn Lys Leu Ala Thr Giy 3-r Ala Asp Gly Val He Lys Tyr Trp 

'^■^ 2?^ 30 

(2) INFORMATION FOR SEQ ID NO: 220: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



^^^5^^252 PCT/US95A)1210 

- 2S3 - 

(ii) MOLECJLS TVt3: pspcif.e 

(iii) HYPCTHSTICAL: WO 

5 (iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: SOFl rll. Fig. 43 



10 



15 



30 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:220: 

Gly Leu Cys Val Thr Gin Pro Arg Phe His Asp Lys Lys Pro Asp Leu 
15 10 15 

Lys Ser Gin Asn Phe Met Leu Ser Cys Ser Asp Asp Lys Thr Val Lys 



20 25 

Leu Trp Ser 
20 35 

(2) INFORMATION FOR SEQ ID NO: 221: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



30 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

35 (vi) ORIGINAL SOURCE: 

(C) IiroiVIDUAL ISOLATE: SOFl rlll. Fig. 43 



SZO "^Ij "'"^ ^ . 



Gly Leu lie Arg Thr Phe Asp Gly Glu Ser Ala Phe Gin Gly lie Asp 
^5 XO 15 

Ser His Arg Glu Asn Ser Thr Phe Ala Thr Gly Gly Ala Lys lie His 
^= 20 25 30 
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5 



10 



40 



- ^^4 - 

Leu Trp Asp 
35 

(2) INFORMATION FOR SEQ ID NO: 222: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
15 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: SOFl rIV, Fig. 43 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:222: 

Gly His Ser Arg Glu He Tyr His Thr Lys Arg Met Gin His Val Phe 
15 10 15 

25 

Val Lys Tyr Ser Met Asp Ser Lys Tyr He He Ser Gly Ser Asp Asp 
20 25 30 

Gly Asn Val Arg Leu Trp Arg 
30 35 

(2) INFORMATION FOR SEQ ID NO: 223: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) L£NGTH: 31 amino acids 

(5) TYPE: amino acid 

(D) TCFOLO«3Y: ur.know.i 



! 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



45 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: STE4 -YEAST rl, Fig. 44 
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(Xi) SEQUENCE DESCRIPTION: ID r:0:223: 

Gly His Asn Asn Lys lie Ser Asp Phe Arg Trp Ser Arg Asp Ser Lys 
5 15 10 15 

Arg He Leu Ser Ala Ser Gin Asp Gly Phe Met Leu He Trp Asp 
20 25 30 

10 (2) INFORMATION FOR SEQ ID NO: 224: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



20 



25 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: STE4 -YEAST rll. Fig. 44 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 224: 

Gly His Thr Cys Tyr He Ser Asp He Glu Phe Thr Asp Asn Ala His 
30 1 5 10 15 

He Leu Thr Ala Ser Gly Asp Met Thr Cys Ala Leu Trp Asp 
20 25 30 

35 (2) INFORMAriON FOR SEQ ID NO:225: 

(i) SEQrJElTCS CHTJUVCTSrilSTICS : 

(A) LE:;GTK: 37 .iirdno acids 
'r.r-Z; ar.ino acii 
40 (D) TOPOLOGY: un3aio^T* 

(ii) MOLECULE TYPE: peptide 



45 



(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
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(vi) CHIGIM^ SOURCE: 

(C) lOTIVIDUAL ISOLATE: STE4-YEAST rlll. Fig. 44 



10 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:225: 

Asp His Leu Gly Asp Val Leu Ala Leu Ala He Pro Glu Glu Pro Asn 
15 10 15 

Leu Glu Asn Ser Ser Asn Thr Phe Ala Ser Cys Gly Ser Asp Gly Tyr 
20 25 30 

Thr Tyr He Trp Asp 
35 

(2) INFORMATION FOR SEQ ID NO: 226: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 
20 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

25 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 
30 (C) INDIVIDUAL ISOLATE: STE4-YEAST rIV, Fig. 44 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 226: 

35 Leu Asp Asn Gin Gly Val Val Ser Leu Asp Phe Ser Ala Ser Gly Arg 

15 10 15 



Leu Hec Tyr Ser Cys Tyr Tnr As? lie Gly Cys Val Val Trp Asp 

-0 15 
(2) INFORMATION FOR SEQ ID NO: 227: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 
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(ii) MOLECULfi T.'FE: peptide 

(iii) HYPOTHETICAL: NO 

5 (iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: STS4 -YEAST rV, Fig. 44 



XO 



15 



20 



25 



35 



40 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 227: 

Gly His Gly Gly Arg Val Thr Gly Val Arg Ser Ser Pro Asp Gly Leu 
15 10 15 

Ala Val Cys Thr Gly Ser Trp Asp Ser Thr Met Lys lie Trp Ser 
20 2S 30 

(2) INFORMATION FOR SEQ ID NO:228: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

30 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: TRNSCRPTION FCTR TIIF rl. Fig. 45 



(xi) SEQU3:NCE DESCRIPTIOII: SEQ ID NO: 228: 

Gly His Thr Gly Fro Val Tyr Arg C/s Ala Phe Ala Pro Glu Met Asn 
^ 3 15 

Leu Leu Leu Ser Cys Ser Glu Asp Ser Thr He Arg Leu Trp Ser 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 229: 

(i) SEQUENCE CHARACTERISTICS: 
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10 



(A) LEi.-GrH: 31 amino acids 

(B) TYPZ: amino acid 
(D) TOPOLOGY: iinkncwn 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: TRNSCRPTION FCTR TUP rll. Fig. 45 

15 Ui) SEQUENCE DESCRIPTION: SEQ ID NO: 22 9: 

Gly His Val Tyr Pro Val Trp Asp Val Arg Phe Ala Pro His Gly Tyr 
15 10 15 

20 Tyr Phe. Val Ser Cys Ser Tyr Asp Lys Thr Ala Arg Leu Trp Ala 

20 25 30 

(2) INFORMATION FOR SEQ ID NO: 230: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

30 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

35 

(vi) origi::al source: 

(C) individual IS^M-ATE: T?^13CRPTI0N FCTR TIIF rlll. Fig. 45 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:230: 

Gly His Leu Ser Asp Val Asp Cys Val Gin Phe His Pro Asn Ser A.:: 
^5 10 15 

45 Tyr Val Ala Thr Gly Ser Ser Asp Arg Thr Val Arg Leu Trp Asp 

20 25 30 
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(2) II.'FORMVriOrr for SEQ id II0:231: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: unJmown 

(ii) MOLECULE TYPE: peptide 

10 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



PCT/US9S/01210 



15 



20 



25 



30 



40 



45 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: TRNSCRPTION FCTR TIIF rIV, Fig. 45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 231: 

Gly His Lys Gly Ser Val Ser Ser Leu Ala Phe Ser Ala Cys Gly Arg 



10 



15 



Tyr Leu Ala Ser Gly Ser Val Asp His Asn He He He Trp Asp 
2° 25 30 

(2) INFORMATION FOR SEQ ID NO: 232: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 
35 (iii) HYPOTHETICAL: NO 

(iv) A:m-£z::s2: no 



(C) INDIVIDUAL . . .JVTE: TRNSCRPTION FCTR TIIF rV, Fi - . 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 232: 

Arg His Thr Ser Thr Val Thr Thr He Thr Phe Ser Arg Asp Gly Thr 
^5 10 15 
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Val Leu Ala Ala Ala Gly I.eu Arp A.n Asn Leu Thr L^u -^o Asp 
20 23 



30 



10 



20 



25 



30 



35 



45 



(2) INFORMATION FOR SEQ ID NO: 2 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

15 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: TUPl rl. Fig. 46 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 233: 

Ser Ser Asp Leu Tyr He Arg Ser Val Cys Phe Ser Pro Asp Gly Lys 
^ 5 10 



15 



Phe Leu Ala Thr Gly Ala Glu Asp Arg Leu lie Arg He Trp Asp 
2° 25 30 

(2) INFORMATION FOR SEQ ID NO: 234: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: xinknown 



(ii) MOLECULE TYPE: peptide 

(iii) H-fFOTKZTIC-j;.: >:0 

40 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: TUPl rll. Fig. 46 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:234: 
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Cly His Glu Gin Asp Il» Tyr Ser Leu Asp Tyr ?h- Fro 3zr aiy Asp 
" 5 10 15 " 

Lys Leu Val Ser Gly Ser Gly Asp Arg Thr Val Arg He Trp Asp 
= 20 25 30 

(2) INFORMATION FOR SEQ ID NO:235: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



15 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

20 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: TUPl rlll, Fig. 46 



25 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:235: 

He Glu Asp Gly Val Thr Thr Val Ala Val Ser Pro Gly Asp Gly Lys 



10 



15 



Tyr He Ala Ala Gly Ser Leu Asp Arg Ala Val Arg Val Trp Asp 
20 25 



30 



(2) INFORMATION FOR SEQ ID NO: 236: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

iZ) T/!>S: ar.ino acid 
iD) TOPCLCGV: urJcncra 

40 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 



45 



(iv) ANTI-SENSE: NO 
(vi) ORIGINAL SOURCE: 



PCTA'S9S/012I0 
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(C) IJJDIVIDUAL ISCIu^iTE: TU?1 rIV, Fig. 45 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:236: 

Gly His Lys Asp Ser Val Tyr Ser Val Val Phe Thr Arg.Asp Gly Gin 



20 



30 



10 



15 



Ser Val Val Ser Gly Ser Leu Asp Arg Ser Val Lys Leu Trp Asn 
20 25 30 

(2) INFORMATION FOR SEQ ID NO:237: 

(i> SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

25 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: TUPl rV, Fig. 46 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 7: 

Gly His Lys Asp Phe Val Leu Ser Val Ala Thr Thr Gin Asn Asp Glu 



10 



15 



Tyr He Leu Ser Gly Ser Lys Asp Arg Gly Val Leu Phe Trp Asp 

30 



35 20 25 



{2) IN7CR:-UVTICN FO?. SIQ id NO: 238: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unlcnown 



45 



(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
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{ivj ANT SENSE: KO 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: TUPl HOMOLOG rl. Fig. 47 

5 

(xi) SEQUENCE DESCRIPTION: SEQ. ID NO:238: 

Asp Phe Ser Asp Asp Cys Arg lie Ala Ala Ala Gly Phe Gin Asp Ser 
10 1 5 10 15 

Tyr He Lys He Trp Ser 
20 

15 (2) INFORMATION FOR SSQ ID N0:239: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
20 (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



25 



30 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: TUPl HOMOLOG rll, Fig. 47 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 239: 

Gly His Ser Gly Thr Val Tyr Ser Thr Ser Phe Ser Pro Asp Asn Lys 

Tyr L=u Lsu S-r Gly Ser Glu Asp Lys Thr Val Arg Leu Trp Ser 

23 30 

40 (2) INFGRT'IATION FOR SEQ ID NO: 240: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
45 (D) TOPOLOGY: unknown 
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(ii) MOLECULE TV?S: peptida 

(iii) HYPOTHETICAL: NO 

5 (iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: TUPl HOMOLOG rlll. Fig. 47 



10 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 240: 

Gly His Asn His Pro Val Trp Asp Val Ser Phe Ser Pro Leu Gly His 
IS 10 15 

Tyr Phe Ala Thr Ala Ser His Asp Gin Thr Ala Arg Leu Trp Ser 
20 25 30 



20 {2) INFORMATION FOR SEQ ID N0:241: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
25 (D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



30 



35 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: TUPl HOMOLOG rIV, Fig. 47 

(xi) SEQL^NCE DESC?.:?riCN: SEQ ID NO:241: 

'•iy His L-u ;..Ti k^-- "-*. V?.l :-?r rhi .--i i Pro ^--n C-lv Cv3 

^0 1 5 10 i/ 

Tyr Val Phe Thr Gly Ser Ser Asp Lys Thr Cys Arg Met Trp Asp 
20 25 30 

45 (2) INFORMATION FOR SEQ ID NO: 242: 
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(1) SZ'ZulLr.CZ CI{A?JJlCTE!IISTICS : 

(A) LSJIGTH: 31 £T.ino acids 

(B) TVrS: amino acid 
(D) TOPOLOGY: unJcnown 

5 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

10 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: TUPl HOMOLOG rV, Fia. 47 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:242: 

Gly His Thr Ala Pro Val He Ser He Ala Val Cys Pro Asp Gly Arg 
^ 5 10 



15 



20 



25 



30 



40 



45 



15 



Trp Leu Ser Thr Gly Ser Glu Asp Gly He He Asn Val Trp Asp 
20 25 



30 



(2) INFORMATION FOR SEQ ID NO: 24 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unJoiown 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

35 (iv) ANTI- SENSE: NO 

(vi) C?.I'^i:;.'iX SOURCE: 

(C) I:XJIV:DUAL isolate: TUPI HOMOLOG rVI, Fig. 47 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 243: 

Gly His Gly Lys Asn Ala He Tyr Ser Leu Ser Tyr Ser Lys Glu Gly 
^5 10 15 

Asn Val Leu He Ser Gly Gly Ala Asp His Thr Val Arg Val Trrj As? 
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20 25 



5 



10 



20 



25 



30 



35 



45 



30 



(2) INFORMATION FOR SEQ ID NO: 244: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: \mknown 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

15 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: YCU7 rl, Fig. 48 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 244: 

Gly His Phe Asp Ser Thr Asn Ser Leu Ala Tyr Ser Pro Asp Gly Ser 
^ S 10 15 

Arg Val Val Thr Ala Ser Glu Asp Gly Lys lie Lys Val Trp Asp 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 245: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



(ii) KOLECLT-E TVP3:: peptide 

(iii) rriP0T>:2TIC.--L: NO 

40 (iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: YCU7 rll, Fig. 46 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 245: 
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• Glu His Thr Ser Ser Vai Tl-r Ala Val Gin Phe Ala Lys Arg Gly Gin 
15 10 15 

Val Met Phe Ser Ser Ser Leu Asp Gly Thr Val Arg Ala Trp Asp 
5 20 25 30 



(2) INFORMATION FOR SEQ ID NO: 246: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY; unknovni 

15 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



20 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: YCU7 rlll. Fig. 48 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 246: 

Arg lie Gin Phe Asn Cys Leu Ala Val Asp Pro Ser Gly Glu Val Val 
^ . 5 10 15 

30 Cys Ala Gly Ser Leu Asp Asn Phe Asp He His Val Trp Ser 

20 25 30 

(2) INFORMATION FOR SEQ ID NO: 247: 

33 (i) SEQUENCE CHAPJVCTERISTICS : 

(A) LENGTH: 31 amir.o acids 
(3) TYPE: a.-?ino acid * 
(D) TOPOLOGY: urJ^nown 

40 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 



45 



(iv) ANTI- SENSE: NO 
(vi) ORIGINAL SOURCE: 
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(C) IIZDIVIDUAL ISOIAr^: VCU7 rIV, Fic. 43 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 247: 

5 

Gly His Glu Gly Pro Val Ser Cys Leu Ser Phe Ser Gin Glu Asn Ser 
15 10 15 

Val Leu Ala Ser Ala Ser Trp Asp Lys Thr lie Arg He Trp Ser 
10 20 25 30 

(2) INFORMATION FOR SEQ ID NO: 24 8: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: imknovm 



20 



(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



25 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: YCW2 PROTEIN rl, Fig. 49 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:248: 

30 

Gly His Gly Ser Thr He Leu Cys Ser Ala Phe Ala Pro His Thr Ser 
1 5 10 15 

Ser Arg Met Val Thr Gly Ala Gly Asp Asn Thr Ala Arg He Trp Asp 
35 20 25 30 



(2) IKFORMATIGN FOP. SSQ ID MO: 243: 



40 (i) SEQUENCE CHARACTERioTICS i 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: \inknovni 



45 



(ii) MOLECULE TYPE: peptide 
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(iii) hy?ot::ztical: no 

(iv) Aim-SSIISE: NO 

5 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: YCW2 PROTEIN rll. Fig. 49 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 249: 

10 

Gly His Tyr Asn Trp Val Leu Cys Val Ser Trp Ser Pro Asp Gly Glu 
15 10 15 

Val lie Ala Thr Gly Ser Met Asp Asn Thr lie Arg Leu Trp Asp 
15 20 25 30 

(2) INFORMATION FOR SEQ ID NO: 2 50: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 38 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: uxiJcnown 



25 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

30 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: YCW2 PROTEIN rlll. Fig. 49 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:250: 

35 Gly His Ser Lys Trp lie Thr Ser Leu Ser Trp Glu Pro lie His Leu 

15 10 15 

Val Lys Pro Gly Ser Lys Pro Arg Leu Ala Ser Ser Ser Lys Asp Gly 

;o 



40 



45 



Thr lie Lys lie Trp Asp 
35 

(2) INFORMATION FOR SEQ ID N0:25X: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LEN'GT:-: 30 ardno acids 

(B) TYrS: amino acid 
(D) TOPCLCGY: unJcnown 

5 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

10 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: YCW2 PROTEIN rIV, Fig. 49 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:251: 

Gly His Thr Asn Ser Val Ser Cys Val Lys Trp Gly Gly Gin Gly Leu 
1 5 10 15 

20 Leu Tyr Ser Gly Ser His Asp Arg Thr Val Arg Val Trp Asp 

20 25 30 

(2) INFORMATION FOR SEQ ID NO: 252: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unJoiown 

30 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

35 

(vi) ORIGINAL SOURCE: 

(C) IIZ)IVIDUAL ISOLATE: Ya^2 PROTEIN rV, Pig. 43 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2S2: 

Lys He Cys Lys Lys Asn Gly Asn Ser Glu Glu Met Met Val Thr Ala 
^5 10 15 

45 Ser Asp Asp Tyr Thr Met Phe Leu Trp Asn 

20 25 
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(2) IKFOn:-L\TION FCR SEQ ID ::0:2 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

10 (iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 
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(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: YCW2 PROTEIN rVI, Fig. 49 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:253: 

Asn His Val Ala Phe Ser Pro Asp Gly Arg Tyr He Val Ser Ala Ser 

Phe Asp Asn Ser He Lys Leu Trp Asp 
20 25 

(2) INFORMATION FOR SEQ ID NO: 2 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 
35 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SE:7o3: NO 



(C) INDIVIDUAL ISOLATE: YCW2 PROTEIN rVII, Fig. 45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 54: 

Gly His He Ala Ser Val Tyr Gin Val Ala Trp Ser Ser Asp Cys Arg 
'5 10 15 
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Leu L<2U Val Ser Cys 3?r Lys A-p Thr Thr Leu Lys Val Tro Aso 



10 



25 



20 



(2) INFORMATION FOR SEQ ID NO: 255: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



30 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

15 (iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: YCW2 PROTEIN rVIII, Fig. 49 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:255: 

Ser Val Asp Leu Pro Gly He Lys Thr Lys Leu Tyr Val Asp Trp Ser 
15 10 15 



Val Asp Gly Lys Arg Val Cys Ser Gly Gly Lys Asp Lys Met Val Arg 
20 25 30 



Leu Trp Thr 
35 
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(2) IlIFOPvj^TION FOR SEQ ID :;0:255: 

(i) GEQUENC3 CKAHACTZRISTICS : 

(A) LENGTH: 29 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: imknora 

(ii) MOLECULE TYPE: peptide 

10 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 
15 (C) INDIVIDUAL ISOLATE: YKL525 rl, Fig. 50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:256: 

20 Leu His Leu Tyr Ala Pro Val Phe Tyr Ser Asp Val Phe Arg Val Phe 

15 10 15 



25 



Met Glu His Ala Leu Asp lie Leu Asp Ala Asn Trp Ser 
20 25 
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(2) INFOPJ'I^.TICN FCR ££Q ID NO: 257: 

(i) SEQUENCS CHAHACTHRISTICS : 

(A) LENGTH: 32 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

10 (iii). HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOTOCE: 
15 (C) INDIVIDUAL ISOLATE: YKL525 rll. Fig. 50 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:257: 



20 



Val His Pro Asp Phe Val Thr Ser Ala He Phe Phe Pro Asn Asp Asp 
15 10 15 

Arg Phe He He Thr Gly Cys Leu Asp His Arg Cys Arg Leu Trp Ser 
20 25 30 



25 
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(2) INFORI'lATIC:: rOH SE3 ID NO:25S; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: unJcnown 

(ii) MOLECULE TYPE: peptide 

10 (iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 
15 (C) INDIVIDUAL ISOLATE: yrb 1410 yeast rl, Fig. 51 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:258: 

20 Gly His Asn His Pro Val Trp Asp Val Ser Phe Ser Pro Leu Gly His 

15 10 15 



25 



Tyr Phe Ala Thr Ala Ser His Asp Gin Thr Ala Arg Leu Trp Ser 
20 25 30 
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(2) INFORI^TIGN FOR SHQ ID 110:213: 

(i) SEQUENCE CHAHACTERISTICS : 

(A) LENGTH: 31 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

10 (iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 



15 



20 
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(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: yrb 1410 yeast rll, Pig. 51 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:259: 

Gly His Leu Asn Asp Val Asp Cys Val Ser Phe His Pro Asn Gly Cys 



10 



15 



25 



Tyr Val Phe Thr Gly Ser Ser Asp Lys Thr Cys Arg Met Trp Asp 
20 25 



30 
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(2) larcRi'iATic:; for S2q id no-. 250 : 

(i) SHQUZi^CS CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 
5 (B) TYPE: amino acid 

(D) T0P0LCX3Y: unknown 

(ii) MOLECULE TYPE: peptide 

10 (iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 
15 (C) INDIVIDUAL ISOLATE: yrb 1410 yeast rlll. Fig. 51 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:260: 

20 Gly His Thr Ala Pro Val lie Ser He Ala Val Cys Pro Asp Gly Arg 

15 10 15 



25 



Trp Leu Ser Thr Gly Ser Glu Asp Gly He He Asn Val Trp Asp 
20 25 30 
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(2) liirORViATION FOR SZQ ID ;vO:26l: 

(i) SEQUZNC3 CHAi^CTER I 3 TI CS : 

(A) LENGTH: 32 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

10 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



15 



20 
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(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: yrb 1410 yeast rIV, Fig. 51 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:261: 

Gly His Gly Lys Asn Ala He Tyr Ser Leu Ser Tyr Ser Lys Glu Gly 



10 



15 



25 



Asn Val Leu lie Ser Gly Gly Ala Asp His Thr Val Arg Val Trp Asp 
2° 25 30 



wo 95/21252 

- 283 - 

(2) INFOKT'lATION SZQ ID :iO:2S2: 

(i) SEQUZliCZ CK/JLXCTEKISTICS : 

(A) LENGTH: 34 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

10 (iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 



15 



20 



PCr/DS95/01210 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: WD40 Consensus Sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:262: 

Gly His ser Ala Ala Leu Ala Ala Leu Ala Leu Ser Pro Asp Ala Ala 



10 



15 



Ala Ala Ala Leu Ala Ser Gly Ala Arg Asp Ala Thr Leu Arg Leu Trp 

30 



20 25 



25 Asp Leu 
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(2) INFO?J.i:iTIO:J FC?. SEC 10 1:0:2^2: 

(i) SEQU2NC2 C:i.:iRACTZRI3TICS: 
(A) LENGTH: 5 amino acids 
5 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

10 

(iii) HYPOTHETICAL: YES 
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(iv) ANTI-SENSS: NO 



15 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: WRTAA peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:263: 

20 Trp Arg Thr Ala Ala 

1 5 



wo 95/21252 ... PCT/US95/01210 

(2) i:j?o-^M-mcM ?■:?. S£q id iiO:2r^-ii 

ii) SECUEI.'CE CHAPACTERISTICS ; 
(A) LENGTH: 5 amino acids 
5 (B) TYPE: amino acid 

(C} STRANDEDNESS : single 
(D) TOPOLOGY: linear 



10 



(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: YES 
(iv) Aim -SENSE: NO 



15 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: WRTAV peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 264: 

20 Trp Arg Thr Ala Val 

1 5 
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(2) i:;FORfIATION FOR S3Q 10 y,0z2-SB: 

(i) SSC'JEITCS CKA?J\CTE?.I5TICS ; 
(A) LENGTH: 4 amino acids 
5 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

10 

(iii) HYPOTHETICAL: YES 

(ivj ANTI-SENSE: NO 

^5 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: WRTA peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:265: 

•0 Trp Arg Thr Ala 

1 
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30 
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1. A polypeptide composition effective to alter the 
activity of a first protein, wherein the first protein interacts with a 
second protein, and the second protein contains at least one WD-40 
region, 

said polypeptide having between 4 and 50 amino acids whose 
sequence is the same as a sequence of the same length in the WD-40 region 
of the second protein. 



2. The composition of claim 1, wherein said polypeptide 
inhibits interactions between the first protein and the second protein; 
ar.d/or wherein said polypeptide is an agonist of the activity of the 
first protein; and/or wherein said polypeptide is an antagonist of the 

15 activity of the first protein. 

3. The composition of claim 1 or 2, wherein said WD-40 
region has an amino acid sequence derived from the group consisting of 
SEQ ID NO:76-26l. 

20 

4. The composition of claim 3, wherein said WD-40 region 
has an amino acid sequence selected from the group consisting of SEQ ID 
NO:76-261. 

5. The polypeptide con^osition of claim 1 wherein said 
polypeptide is coupled to a solid support. 

6. A method to bind selectively said first protein which 
method comprises contacting a sample putatively containing said first 
protein with the polypeptide composition of claim 5; and 

removing any unbound con^onents of the sample from said 

composition, 

7. A method to assess the interaction of a first protein 
35 with a polypeptide having a sequence the same as a sequence of the same 

length ccntainsd in a WD-40 region of a second protein, which method 
ccT.rric^s 

contacting a c-imcle containing said f -.rst protein vrith 
^ ■ -y^^^J '-^^ c::.n=cj--i;:n wl:-:r^in r : :>l\rc^pzi- 2 '-sj ■ r^n 4 and ZO 
40 amino acids whose sequence is the same as the sequence of the sam- 

in the WD-40 region of the second protein, and observing any inter:: . 
of the first protein with said polypeptide composition. 



45 



8 . A method to assess the ability of a candidate compound 
to bind a first protein which method comprises contacting said first 
protein with a polypeptide composition which binds said first protein, 
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wherein the polypeptide of sail composition has between 4 r-d amino 
acids whose sequence is tl- ^-.-.i .3 1 ssq-ience of che SE.T.a l»n-ch in a 
WD-40 region of a second protein which interacts with said first protein 
in the presence and absence of said candidate compound; and 

measuring the binding of said polypeptide in the presence 
and in the absence of said candidate, 

wherein decreased binding of the polypeptide in the presence 
as opposed to the absence of said candidate indicates that said candidate 
binds to said first protein. 

9. A method to alter the activity of a first protein that 
interacts with a second protein, where the second protein contains at 
least one WD-40 region, said method comprising 

selecting a polypeptide having between 4 and 50 amino acids 
whose sequence is the same as a sequence of the same length in the WD-40 
region in the second protein, and 

contacting said polypeptide with said first protein under 
conditions which allow the formation of a complex between the polypeptide 
and the first protein, where said interaction is effective to alter the 
activity of the first protein. 

10. The method of claim 9, wherein said contacting is 
effective to inhibit the interaction between said first and second 
proteins; and/or wherein said contacting is effective to stimulate the 
activity of said first protein; and/or wherein said contacting is 
effective to inhibit the activity of said first protein. 

11. The method of any of claims 5-10, wherein said 
polypeptide is derived from the group consisting of SEQ ID NO: 76 -261. 

12. The method of claim 11, wherein said polypeptide is 
selected from the group consisting of SEQ id NO: 76-261. 

13. A composition of DNA molecules which consists of DNA 
molecules having a nucleotide oeq^.ence encoding the polypeptide of any 
of claims 1-4. 



, "-''^ Tvole-ula whirh ccr.prises an sxpressicn system for 

4. system comprises a nucleotide sequence encoding said polypeptide op - - 

linked to control sequences capable of effecting the expression of said 
encoding nucleotide sequence. 

Recombinant host cells modified to contain the 
45 expression system of claim 14. 
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16. A mechod to prciu::e a polypeptide havir^ bc:-wrt«n 4 and 
50 ardr.o acids whose sequer.ce is the 3an:e as tr.a ssq^-zrice cf cha sa^.e 
length in a ro-40 region of a second protein which interacts with a first 
protein, which method comprises culturing the cells of claim 15 under 

5 conditions wherein said nucleotide sequence is expressed to produce said 
polypeptide; and 

optionally recovering said polypeptide from the culture. 

17. A polypeptide composition effective to alter the 
10 activity of a protein kinase C, where the protein kinase C interacts with 

a second protein, and the second protein contains at least one WD-40 
region, 

r,aid pclweptide having between 4 and 50 a:nino whose 
sequence is the same as a sequence of the same length in the WD-40 region 
15 of the second protein. 

18. The composition of claim 17, wherein said second 
protein is a receptor for activated protein kinase C. 

20 19. The composition of claim 18, where said second protein 

has the sequence represented by SEQ ID NO: 27. 

20. The composition of claim 17, wherein said polypeptide 
is an agonist of the activity of protein kinase C; and/or wherein said 

25 polypeptide is an antagonist of the activity of protein kinase C; and/or 
wherein said polypeptide inhibits interactions between protein kinase C 
and the second protein. 

21. The composition of claim 20 wherein said polypeptide 
30 has the sequence represented by SEQ ID NO: 7, SEQ ID NO: 4 or SEQ ID NO: 2. 

22. The composition of claim 17, wherein said WD-40 region 
has an amino acid sequence derived from the group consisting of SEQ ID 
NO:69-75. 

25 

23. Tha composition of claim 22, wherein said WD-40 region 
has amino acid ssqu-nce selazted fr^m the group consisting of SEQ ID 
210:53-75. 

24. The polypeptide composition of claim 17 wher- 
polypeptide is coupled to a solid support. 

25. A method to bind selectively protein kinase C which 
method comprises contacting a sample putatively containing protein kinase 

45 c with the polypeptide composition of claim 24; and 
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composition. ^^^'^ 

26. A method to assess the interaction of protein kinase 
C wxth a polypeptide having a sequence the same as a sequence of the 

c:2riseT*'"' °' ^ °^'=°'^'^ w^ich teZ 

„^^H , '^''"'^^^^'^^ * ^^"^^^ containing said protein kinase C 

wxth a polypeptide composition wherein the polypeptide has between 4 and 
50 amxno acxds whose sequence is the same as the sequence of the same 
length .n the WD-40 region of the second protein, and observing any 
interaction of the protein kinase C with said polypeptide composition 

27 . A method to assess the ability of a candidate compound 
to bind protein kinase C which method comprises contacting said protein 
kxnase C with a polypeptide composition which binds said protein kinase 
C wherein the polypeptide of said composition has between 4 and 50 amino 
ac.ds Whose sequence is the same as a sequence of the same length in a 
WD-40 region of a second protein which interacts with said protein kinase 
C, m the presence and absence of said candidate compound; and 

and in ^>, ^""''^^ """^ ""^"""^"^ °' '^'^ polypeptide in the presence 
and in the absence of said candidate, 

wherein decreased binding of the polypeptide in the presence 

25 IIZ'IT V ^'^^"^^^ '^'"'^'^ ^"'^^^-^ said c!:: Lte 

25 bands to said protein kinase C. 

that inter. ' -..^ '''^^ '"^^ ^^i^ase C 

:rL— — P-ein contains 



15 



20 



30 



35 



at least one WD-40 region, comprising 

wh... a polypeptide having between 4 and 50 amino acids 

regLn^irtr^ " '^"^^ ^ ^^^^'^^^ ^^^^^ ^he WD-40 

region in the second protein, and 

condit • polypeptide with said protein kinase C under 

conditions which allow the formation of a complex between the polypeptide 
and the protein kinase C, where said interaction alters the IcZL of 
said prctein kinase C. -ctivicy or 



^.:: = ^--i'.- 'o~^^^<--'^^'-^^'^° "^^'''^ ^^""-••^ contacting is 

s^lliTth ^"^^"^^'^ "said "'contactiilg'' Is ' e^eJti ' " ] 

stimulate the activity of said protein kinase C; and/or wherein • 
contacting is effective to inhibit the activity of said p^eln kil^se 
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30. The method of claim 23, vheroia said prl%-:?-r -ida has 
en amino acii sequence repres^r. -oi by SJQ TO SZQ ID i:0:4 cr SEQ ID 

N0:7. 

5 31. The method of claim 28, wherein said polypeptide is 

derived from the group consisting of SEQ ID N0:69-75. 

32. The method of claim 31, wherein said polypeptide is 
selected from the group consisting of SEQ ID NO: 69-75. 

10 

33. A composition of DNA molecules which consists of DNA 
molecules having a nucleotide. sequence of encoding the polypeptide of any 
of claims 17-23. 

^5 34. A DNA molecule which comprises an expression system for 

the production of the polypeptide of any of claims 17-23 which expression 
system comprises a nucleotide sequence encoding said polypeptide operably 
linked to control sequences capable of effecting the expression of said 
encoding nucleotide sequence. 

20 

35. Recombinant host cells modified to contain the 
expression system of claim 34. 

36. A method to produce a polypeptide having between 4 and 
25 50 amino acids whose sequence is the same as the sequence of the same 

length in a WD-40 region of a second protein which interacts with protein 
kinase C, which method comprises culturing the cells of claim 3 5 under 
conditions wherein said nucleotide sequence is expressed to produce said 
polypeptide; and 

optionally recovering said polypeptide from the culture. 
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Fig- 11 

Human 56 kDa protein (PWP homolog) 
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CDC4 / CDC20 protein 
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Coronin (p55) 
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GROUCHO PROTEIN DROSOPH 



1 mypspvrhpa aggpppqgpi kftiadtler ikeefnflqa hyhsiklece 
51 klsnektemq rhyvmyyems yglnvemhkq teiakrlntl inqllpflqa 
101 dhqqqvlqav erakqvtirqa Inliigqqih cqqvpggppq pmgalrpfga 
151 Igatmglphg pqgllnkppe hhrpdikptg legpaaaeer Irnsvspadr 
201 ekyrtrspld iendskrrkd eklqedegek sdqdlwdva nemeshsprp 
251 ngehvsmevr dreslngert ekpsssgikq erppspsgss ssrstpslkt 
301 kdmekpgtpg akartptpna aapapgvnpk qiranpqgpppa gypgapyqrp 
351 adpyqrppsd paygrpppmp ydphahvrtn giphpsaltg gkpaysfhmn 
401 gegslqpvpf ppdalvgvgi prharqinti shgewcavt isnptkyvyt 
451 ggkgcvkvwdisqpgnknpv sqldclqrdn yirsvkllpdgrtlivgqea 
501 snlsiwdlas 



511 ptpri kaeltioa jacyal 
553 hneilvrqfqgtitdgascidispdgsjlLwt 



aspdsjc^cf s :csdoniQ Vwdl 

3gl|lntyrsis£diregrql 



691 nondfzsqif slgycntrlvAcv.penrjhv evlhcskpdk yqlhlhsscv 
701 kyivtgsgdk katvyeviy 



Fig. 27 



SUBSTITUTE SHEET (RULE 26) 



wo 95/21252 



PCTAJS9S/01210 



27/53 



cr 

CO 



0) 

O. 



at 
c 



t3 



xi 
L 
E 

cr 

IL 
> 

> 
C 

a 
a 

E 

a 



•a 

a 
a 
o 

£- 
0) 

or 
c 

O) 
4J 

or 



U1 



0) 

t. 

V) 

> 

> 

c 



r-» 

to 
u 



c 

O) 

u 
a 



cn 

a 
>^ 
a 
u 



2: 
I/) 
1/) 

L. 

a 
> 



C 

cj) cn 



.5 



a a a 1 1 



u a 
in 

E C3 

CO a 

-4^ 



U U 4J 

4J C C3 

C3 U- > 

^ ^ E 







4J 




> 






















• 


c 


c 




s. 




E 




CO 

to 




•X3 


c 


4- 


-C3 


CX 




C3 




a 




> 




1— » 




lO 


U 




4J 




u 




•r-» 




u 




C3 


CD 






c 




I—* 




•r-» 








"a 


•r-» 






10 


C 



-H -P ^ 
"cl "3 



> 

CJI 
C3 

U 



00 



•H 

PC4 



SUBSTITUTE SHEET (RULE 26) 



wo 95^1252 



PCr/DS9S/D1210 



28/53 



lEF SSP 9306 



1 madkeaafdd aveervinee ykiwkkntpf lydlvmthal ewpsltaqwl 
51 pdvtrpegkd fsihrlvlgt htsdeqnhlv iasvqlpndd cqfdashyds 
101 ekgefggfgs vsgkieieik inhegevnra ryropqnpcii atktpssdvl 
151 vfdytkhpsk pdpsgecnpd 



307 klklhs 



171 lrlrfllL(|<eg yglswnpnlsg h|llsasddhtic lwdis nv 
pkegkwdak 

221 tiftfllitjaw edvswhllhe slffgsvaddqjajniMtrsn 
261ntskpstis y^dahtaevnclsf npysef i latqsofiktva lwdl rnl 
; Feshkdei f qvqwsphneti 



lassqtdr rlnvwdl s 



351 kigeeqspedaedgppeUfihgghtakisdf swopne 



Fig. 29 



SUBSTITUTE SHEET (RULE 26) 



wo 95/21252 



PCrA7S9SA)1210 



^ J / J, J 



HUMAN 12.3 



1 mteqmtlrgtlkglin<|wvtqiattpqfpdm iljsQsrdktii mwklt rdet 
51 nygipqralrglis.|ifvsdwissdgq fa] 
95 gtttrrfvfl|it.fc dvlsvafssdn rqivj; 

ssnpii 



LIVS 



pdgslcc sggkdgqamlwdln 



137 vcky tvqde shs i iwvscvrf spn 
181 ncklktnhighijyTntvtvs 
222 egkhUtldggdii nalcfspnrywl 

263 gkiivdelkqevist|sskaeppqctslawsadgqtlf[igytd 



saswdQtlrlwdltt 
sgsriktiklMltlg 
cgwdkl^kyjatola 



QQtaps ikiwdle 

ni vrvwavt igtr 



Igo 30 



SUBS7ITUTE£HEET(RULE26) 



wo 95/21252 

PCT/US95A>1210 



30 / 53 



lEF -7442 - human 
1 maskemfedt veervineey kiwkkntpfl ydlvrathalq wpsltvqwlp 
51 evtkpegkdy alhwlvlgth tsdeqnhlw arvhipndda qfdashcdsd 
101 kgefggfgsv tgkieceiki nhegevnrar ympqnphiia tktpssdvlv 
151 fdytkhpakp dpsgecnpdl 



171 rlrflhqkfigyglswnsnlsghlli asddhtvclHdinagpkegkivdaka 
ZZliftflhsa^ 'vedvawhllheslfgi vaddqklmiadtrsnt 
261 ts kpshl vdaiita< vncl sf npysef i lat gsadktvalfidl ml kl klh 
311 tfeshkdeifqvhwsi hneti las sgtdrrlnvjKdlskigeeqsaedaed 

361 gppel I f i hgattal :i sdf swnpnepvwi 



vsednimqijKgmaeniynd 



411 eesdvttsel egqgs 



SUBSTITUTE SHEET (RULE 26) 



wo 95/21252 PCTAJS9S/01210 



31 / 53 



insulin-like growth factor binding protein complex 
1 malrkgglal allllsvwal gprslegadp gtpgeaegpa cpaacvcsyd 
51 ddadelsvfc ssrnltrlpd gvpggtqalw Idgnnlssvp paafqnlssl 
101 gflnlqggql gslepqallg lenlchlhle rnqlrslalg 



141 tfahtpalaslglsnnrlsrledglffiglgslodlnlgwn slavlpdaaf 



rglgslrelv 



201 lagnrlaylq palfsglael reldlsrnal raikanvfvq Iprlqklyld 
251 rnliaavapg aflglkalrw Idlshnrvag lledtfpgll glrvlrlshn 
301 aiaslrprtf kdlhfleelq Ighnrirqla ersfeglgql evltldhnql 
351 qevkagaflg Unvavmnls gnclrnlpeq vfrglgklhs Ihlegsclgr 
401 irphtf-tgls glrrlflkdn glvgieeqsl wglaelleld Itsnqlthlp 
451 hrlfqglgkl eylUsrnrl aelpadalgp Iqrafwldvs hnrlealpns 

'.lapV^riry Ulrnr.slrt ftpqppgler Iv/lcgncwdc ncplkalrdf 
551 alqnpsavpr fvqaicegdd cqppaytynn itcasppevv gldlrdls- : 
601 hfapc 

Fig. 32 

SUBSTITUTE SHEET (RULE 26) 



wo 95/21252 



PCT/US9Sy01210 



32 / 51 

insulin like growth factor binding protein complex - rat 

1 malrtggpal wllafwval gpchlqgtdp gasadaegpq cpvactcshd 
51 dytdelsvfc ssknlthlpd dipvstralw Idgnnlssip saafqnlssl 
101 dflnlqgswl rslepqallg Iqnlyylhle rnrlrnlavg 



141 Ifthtpslaslslssnllgrleej Ifqglshlwdlnlgwn 



181 slwlpdtvf qglgnlhelv 

201 lagnkltylq palfcglgel reldlsrnal rsvkanvfvh Iprlqklyld 
251 rnlitavapg aflgmkalrw Idlshnrvag Imedtfpgll glhvlrlahn 
301 aiaslrprtf kdlhfleelq Ighnrirqlg ertfeglgql evltlndnqi 
351 tevrvgafsg Ifnvavinnls gnclrslper vfqgldklhs Ihlehsclgh 
401 vrlhtfagls glrrlflrdn sissieeqsl aglselleld Ittnrlthlp 
451 rqlfqglghl eylllsynql ttlsaevlgp Iqrafwldis 
491 hnjiletlaeglfsslgrvrylslrnnslqtfspqpglerlwldcnpwdcs 
541 cplkalrdfo Iqnpgvvprf vqtvcegddc qpvytynnit ccfjpanvsgl 

dlrdvsethf 
601 vhc 

Fig. 33 



SUBSTITUTE SMEET (RULE 2G) 



wo 95/21252 PCTAJS9S/01210 



53 / 53 



LISl Chuman) 

1 mvlsqrqrde Inraiadylr sngyeeaysv fkkeaeldvn eeldkkyagl 
51 lekkwtsvir Iqkkvmeles klneakeeft sggplgqkrd pkewiprppe 



101 kyalsfliirsp 

151 dfertlkfllitds 

191 tmhalidhn 
241 



/trvifhpvfsvmvsa; ledotik vwdy etg 
/qdisf dhsgkllasc; ladmtl klwdf ggfi 



ecir 

/ssvainipngdhivsasrdktik mwevq tgyrvkff 
tflh.rewl/pmvrpnqdgtUascj indqtvpyawatkecka 



291 elrehehweciswapessy 



311 s 



siseatfl5.etk <sgkpgp 



351 Iffltlvgiidnw 



fUsg 

/rgvlfhsggkfilsciddktl 



rdkt kmarivstgmc 
ryadyknk 



391 rcn'.ktlnc3!:£hf;,t5ldfhktapivytg!|vdqtvkv:'[£cr 



Fig. 34 



SUBSTITUTE SHEET (RULE 26) 



wo 95/21252 



PCT/DS9S/01210 



3/. / c3 



MD6 



1 merkdfetwl dnisvtflsl radlqknetld hlislsgavq Irhlsnnlet 
51 llkrdflkll plelsfyllk wldpqtlltc clvskqrnkv isactevwqt 
101 acknlgwqid dsvqdslhwk kvylkailrm kqledheafe 



141 
181 
220 
261 



pvyalyyk 
a avkfde 



tssllghs 
vygiqthtc 
qhfrghtt avf svdysdel 

Intltflhttwvtkvvlqkckvksllhspgdyil'Jsadky 



dg 1 1 c1 g sddlsaklwriv s tgq c 



qkl vi gsfdnts/aqsfiixssgart 



dilvi 



gsadfayjcyjatalsagtc 
ei kiwp igrei 



301 nckclktlsv sedrsiclqp rlhfdgkyiv cssalglyqw 
351dfasydilrv iktpevanla Ugfgdvfal Ifdnhylyim dlrteslisr 
401wplp8yrksk rgtsflager pg 



Fig. 3d 



SUBSTITUTE SHEET (RULE 26) 



W09S«1252 



PCr/US9SA)1210 



35/53 



MSLl 

1 mnqcakdith eassipidlq eryshwkknt kUydylntn stkwpsltcq 
51 ffpdldttsd ehriUssft ssqkpedeti yiskistlgh ikwsslnnfd 
101 mdemefkpen strfpskhlv ndisiffpng ecnrarylpq npdiiagass 
151 dgaiyifdrt khgstPirqs kishpfetkl fgshgviqdv eamdtssadi 
201 neatslawnl qqeaUlssh .sngqvqvwdi kqyshenpii dlplvsinsd 
251 gtavndvtvwn pthdslfaac tegnavsUd Irtkkeklqs 



331 kspia 

381 ggjimlgv 



291 nrekhdggvnscrfn yknslila ►adsngrlnlBfdirnmn 

tmeligts\Jstlewspnf dtvlatagi jedgl vklfijitsceetifth 
ndisw dahdpwlmcsv indn svhiHkpagnlvg hs 



Fig. 36 



SUBSTITUTE SHEET (RULE 26) 



wo 95/21252 



?CTAJS95rt)1210 



35 / 53 



tu 
O 

a 



-M C 

I—" I/) 

> O 
I- T3 

E -O 
4-* 

^ -a 

•r- 

I/) 4J 
•f- T3 

4J -a 

•a o) 
> 

^ >^ 
a 

U) >> 

a; no 
> 



cr oj 

> Q. 
> 

o cr 

0) > 



in 



^."2 

O) i/) I/) 

_ ^ OJ ^ I- 

> ^ or 

OJ o- a>.^ > 

£^ 0) ^ >s^ 

JC > t- tX I- 

^ > U) 01 o 

o> r-rf I/I a> 

£. O) O) (/I U 

a> 0) CTTJ a 

f"' o; o) 0) 0) 

O" « 4-* > 

0) >k O 0) 

0) s- a> 01 L. 

> ^ •»-• x: 

> > E W 
0) i-i 01 01 I/) 

o) c I. u) cn 
c cr a (/) 

0) > 01 0) 

> c ^ 01 u 



c cr 0) a--o 

Q. Oi S- 1-^ 

4-» > a- > a 

•r-» O" V) 01 ^ 01 

c Q. c cn > 

01 -f-* 01 a o" 01 c 

Q-F- f-' 01 > 01 S 

-p > U 01 r- t- 

</s m a 01 n. 

:>\ c jc 

^ 0> J= a. 

T >^ > 

''^^ > 

"'.T V5 Ti iJ 

-J J 

(0 a a 



to 



v3 



C3 
C 



•a 

01 

'A -J^ 

to U) 
H 



tH m r-i 



c 



® LTt o in o 
f-i (Nj fsi m 



> 

C 
> 
— 



in 

ro 



CO 



•H 



SUBSTITUTE SHEET (RULE 26) 



wo 95^1252 



PCT/US9S/01210 



37 /53 



> 
o 

(/) c 
I— u- 
cr > 
a > 
> a 
-a 

> 

to 

•o > 
cnu- > 

E > 

C3 Ol r-' 

• in C3> 
•p- t5 > 

0) 

"O ^ 

c a)-u 

c T3 

O.X: Ji^ 

• V) -r- 

M- a > 

I— • ^ 

c > a 
c cx 
t. JC 

o 

w> ^ cr 
•-^ -2 t. 

f— 

-n 



•H > 

CL > 
>^ 4J 

(/) C 

to C 

f-* L. 

r-i x: 

U L. 

> a 

O) cn 

C 13 

C (/) 

> T3 
C I/) 

a; m 
•a c 

-a 
c 

■a I-* 

> cnu. 



> 
a. 

>\ 
a. 

•I-- 
u 
c 



4^ 

•a 

0) 



cr 



to 
1. 



o 



10 

c 

01 

o; 

10 
(O 
10 
(O 

40 a> 
-M oi 
j= a 



4J s. 

L. 

-a $ 

s. to 

c c 

•f-» to 
> > 

c 

-i^ lO 
r- E 

e>.— • 
a *4- 



ctj o in o LO o 
^ ^ in m u> 



SUBSTITUTE SHEET (RULE 25) 



wo 95/21252 



PCrA7S9S/01210 



jo/ j 3 



ORF RBI 



1 mnqcakdith eassipidlqeryshwkknt kllydylntn stkwpsltcq 
51 ffpdldttsd ehriUssft ssqkpedeti yiskistlghikwsslnnfd 
101 mdemefkpen strfpskhlv ndisiffpng ecnrarylpq npdiiagass 
151 dgaiyifdrt khgstrirqs kishpfetkl fgshgviqdv eamdtssadi 
201 neatslawnl qqealllssh sngqvqvwdi kqyshenpii dlplvsinsd 
251 gtavndvtvwn pthdslfaac tegnavsUd Iptkkeklqs 



381 



291 nrekhdc gvnscrf nykn sli 
331 kspiatmeligt svstlewspnfdtvlata^qedg 
gfltmlgvndiswdah dpwlmcsjandi 



IctsarfsngrlnlHdirnmn 

lyJslMtsceetifth 
n syhittHpagnlvghs 



Fig. 



38 



SUBSrmjTE SHEET (RULE 26) 



wo 95/21252 PCT/US9S/01210 



39 / 



Periodic Trp protein 
1 misatnwvpr gfssefpeky vlddeeveri nqlaqlnldd akatleeaeg 
51 esgveddaat gssnklkdql didddlkeyn leeyddeeia dneggkdvsm 
101 fpglsndsdv kfhegekged pyislpnqed sqeekqelqv ypsdnlvloa 
151 rteddvsyld iyvyddgagf hssdipveeg deadpdvarg Ivrdpalyvh 
201 hdlmlpafpl cvewldykvg snseeaanya aigtfdpqie iwnldcvdka 
251 fpdmilgepl dnsmvslksk 



271 kkkkksktflh ittihtdavl smahnkyf rsvU stsadhtv kladlnsgn 

321 aarslasihs nkiivsssewrfinilngsilltggydsrvaltc vrisdesqmskyissamagee 



381 ietvtfasen iilcgtdsgn vysfdirnne nrkpvwtlka 
421 hdagi sties nkfipgmmst gamgektvkl 

451 wkfplddatn tkgpsmvlsr dfdvgnvlts sfapdievag tniviggvnkv 

531 IkV.vdvftnr svrksfksGl env':arak23 cqkickssri crkytsndnp 
551 dtvitiddqg edeeereggd ehddma 

Fig. 39 



SUBSTITUTE SHEET (RULE 26) 



wo 9501252 



PCT/US95rai210 



PLAP 

1 mhymsghsnf vsyvciipss diyphgliat ggndhnicif sldspmplyi 



51 IkgJikdtvcslssgkf atl' SQSwdtta kvwl ndkcnmtl 
91 q3lit( lavwavkilpeqglm" tQsad kti kl wka grcertf 
131 Igliedcvrglails etef scand asi r r wq itgeclevy 
171 fohtnvivsisvfpnskdf\'ttQedr slriwkh gecaqti 



211 rlpaqsiwcc cvlengdiw gasdgiirvf teseertasa 

251 eeikaslsre spliakvltt eppiitpvrr tlpcrvtrsm issclsrlvs 

301 tslstsdshl titalhlflt tttte 

Fig. 40 



SUBSTITUTE SHEET (RULE 26) 



wo 95/21252 



PCT/US9S/01210 



^1 / 53 




SUBSTITUTE SHEET (RULE 26) 



wo 95/21252 



PCTAJS9S/01210 



?/53 



S253 PROTEIN 



1 mfksktstls ydetpnsneg drnatpvnpk eksqtkhlni pgdrsrhssi 
51 adskrsssry dggysadiip aqlrfidnid ygtrlrktlh rnswsngyn 
m klsendnvyf dlfdrkyfan yUeptyiki fkkksgleqf drnifl^qelk 
151 ipdvykstty qgepavanse Ifknsiccct fshdgkymvi gckdgslhlw 
201 kvinspvkrs emgrseksvs asranslkiq rhlasisshn gsissndlkp 
251 sdqfegpskq Ihlyapvfys 



271 dvfrvfmelia 
311 kyslktfvj 



dildanw skngflito ;mdkta klwhp er 
llpjfvtSQiffpnddrfiitQildhrc rlws i 



351 Idnevsyafd 
4C1 fhvsdkstqg 
451 tndskiqifd 
501 wfytwkmqsf 
551 tetsnqsssh 
601 apdvciknis 
631 cfssnlsnw 
731 fhlsoc^-kin 
ol ceihpnnsp/ 
801 phdiprvstt 
851 fr 



ckdlitsltl 
ttknsfhpss 
Inekkplelf 
nlsaemncta 
tftnssknvl 
IsndUfelt 
nnv^tilitt 
nhrndsilen 

ypklkcdvcn 



sppggeytii 
eygkvqhgpr 
kgfqsgssrh 
phrkkrlsgs 
qtqtvgsqai 
sqyfke;7:gqn 
d^cglirvfr 
rmijerssted 

gsnfecaskn 



gtfngyiyvl 
itglqcffsk 
rgqflmmkne 
mslkgllriv 
knnhyisfha 
ysesketcdn 
tdilpeirkk 
n^fsttppsn 

. , I :<!.•• 

piaggdsgft 



Ithglkfvss 

vdknlrlivt 

pwftgsddh 

snkstndecl 

hnspvtcasi 

kpnhpvtetg 

iiakfheynl 

th'isrpshdf 

cadcgtii. •. 



Pig. 42 



SUBSTITUTE SHEET (RULE 26)" 



wo 95/21252 



PCTAJS95/01210 




SUBSTITUTE SHEET (RULE 26)' 



^«>'»2U52 PCT/DS9Srt)1210 



STE4 - YEAST 

1 raaahqmdsit ysnnvtqqyi qpqslqdisa vedeiqnkie aarqeskqlh 
51 aqinkakhki qdaslfqman kvtsltknki nlkpnivl 



89 



kfllinn cisdf rwsrdsk 



pilsc sqdgfmliMsasglkqnai 



131 



pidsqwvlscaispsstlvasaglnnnctiypvskenrva 



171 
211 

261 
301 

241 
331 



qnvasif kflttc asdieft 



dnahil- lasodintca lwdip 



kakrvreysdhlgi ivlalaipeepnlenssntfa: cgsdgytyiMsrsp 



sa vqsfyvndsdi nal rf f kdgmsi vagsd ngainmydl r 
sdcsiatfslfrgyeertptptymaanmey ntaqspqtlk 



s tss sy IjJnq j-vvs IdfsQsqrl.-nvscvtdiocvvvvdvlk 
! I 



qeivokle png g 



-vt!5vrs5tJdqlavcLijawdstmK i»sp gyq 



Fig. 44 



SUBSTITUTE SHEET (RULE 26) 



wo 95/21252 PCTA7S95/01210 



^5/53 



o 
< 



Q. 

M 



c/) 

I/) 
> 
c 
o 

Oi 
XT 
U 



0) 
0) 

-p 



U 
01 



T3 



O) 

c 
c 
> 

I— • 

s 



a cr 
o > 

a -r- 

a > 
E 

cr in 

JZ 0) 

ox: 
cr 

S >^ 
w -a 

JZ -a 

L. > 

c u 

•O 01 

° >. 

O) a 
r-i cr 

> 01 

a >^ 

r-* cr 

> a 

cru- 
> 

I/) XI 

0) -a 



> c -a a 

a r-» 

C O) 01 ff-^ 
01 0) Oi i-J 

O Q. to 

o> a "o 




1/) 
I— • 

> 



tS t/i tH 



0) S- t/1 ^ 

a "t3 > LI 

u cx a 

•r-» uj cr c5 a 

E -a u 



o o tj^ 

«-i fsj fsi f o I*/) 



•H 



SUBSTITUTE SHEET (RULE26) 



wo 95/21252 



^5/53 



PCrAJS9S/01210 




SUBSTITUTE SHEET (RULE 26) 



wo 95/21252 



PCT/DS9SA)1210 



TUPl 



1 mtasvsntqn klnelldair qeflqvsqea ntyrlqnqkd ydfkmnqqla 
51 emqqirntvy elelthrkmk dayeaeikhl klgleqrdhq iasltvqqqq 
101 qqqqqqqvqq hlqqqqqqla aazasvpyaq qppattsata tpcartttgs 
151 psafpvqasr pnlvgsqlpt ttlpwssna qqqlpqqqlq qqqlqqqqpp 
201 pqvsvaplsn taingsptsk etttlpsvka pestlketep ennntskind 
251 tgsattattt tateteikpk eedatpaslh qdhylvpynq ranhskpipp 
301 flldldsqsv pdalkkqtnd yyilynpalp reidvelhks Idhtswccv 
351 kfsndgeyla tgcnkttqvy rvsdgslvar Isddsaannh rnsitenntt 
401 tstdnntmtt tttttittta mtsaaelakd venlntsssp 



ssdlyjL rsvcf spdgkflojtgaeirl 
I qflJieqcfiysl dyf psgdkl 



441 
481 

521 Itlsiedgv 
571 sgtflhkdsf/ysvvftrdgqsvN 
621 tcevtyiflhkdfVlsvattqndeyiljsgskd 



ioadienrkivmi 
sqsQdrtvpiwdT rfgqr< 

itvavspgdgkyicfagsldra^cyffidsetgflverldsene 



sgslrirsyJiljwnlqnannksdsktpnsg 
rovl fwd kk 



€61 .'n.cnpHiTilqc hrrsvisvcv cncsslgpsy nvfctgsgdc 
/01 /<uriv,k>Aki a^n 



Fig. 46 



SUBSTITUTE SHEET (RULE 26)* 



W0 9»212S2 PCT/DS9S/01210 



i3/53 



TUPl HOMO LOG 



1 msqkqstnqn qngthqpqpv knqrtnnaag ansgqqpqqq sqgqsqqqgr 
51 sngpfsasdl nrivleyink kgyhrtecanl raesgrtltp qnkqspantk 
101 tgkfpeqssi ppnpgktakp isnptnlssk rdaeggivss grleglnape 
151 nyiraysralk nwvdssleiy kpelsyimyp ifiylflnlv aknpvyarrf 
201 fdrfspdfkd fhgseinrif svnsidhike nevasafqsh kyritmsktt 
251 Inlllyflne nesiggslii svinqhldpn ivesvtarek ladgikvlsd 
301 sengngkqnl emnsvpvklg pfpkdeefvk eietelkikd dqekqlnqqt 
351 agdnysgann rtUqeykom nnekfkdntg dddkdkikdk iakdeekkes 
401 elkvdgekkd snlsspardi Iplppktald Ikleiqkvke srdaikldnl 
451 qlalpsvcmy 



461 tf qntnkdmscldfsc icriaaag 
511 dkdedptcktlvgJis^tvystsf spdnky 
561 



vsykglinh pvwdvs fsplghyfatashdqt arlwscdhiy 



601 
641 
686 



plrifagtilr 
gdsvrlfl ght i 
Ikqrargligl* 



dvdcvs fhpngcy\ 
fjpvisi avcpdgrwl 



f qdsyikijfltsldgssl nnpnial nnn 
Isgsedkt jcclntsmdthtal 



ftgssdkt cmsd^st 
stgsedgi in vwdig tgkr 



naiyslsyskegnvT| isggodht ^Eyjwjilkkattep 



731 saepdepfig ylgdvtasin qdikeygrrr tviptsdlva 
771 sfytkktpvf kvkfsrsnla laggafrp 

Fig. 47 



SUBSTITUTE SHEET (RULE 26) 



I 



W09»21252 PCrAIS9SAI)1210 



^9/53 



YCU7 

1 mvrrfrgkel aattfnghrd yvmgaffshd qekiytvskd gavfvweftk 
51 rpsddddnes edddkqeevd iskyswritk khffycnqak vkcvtfhpat 
101 rUavgftsg efrlydlpdf tliqqlsmgq npvntvsvnq tgewlafgss 
151 klgqllvyew 



161 qsesyilkqqghfds 
202 itsgfclatfeehtssl/ta 
251 ryrnfrtftgteriqfn 
291 



bnslay spdgsrwbaseiigkikvijrd 
vqfakrgqvraf ;ssl|lgtvraHdli 



ilavdpsgewcags 



gqlldalsghegp /scl sfsqensvia jaswdktiriHsi 



dnfdih VHsvqt 



111 ff^I'^YfP^ evysdvlals mrpdgkevav stlkgqisif niedakqvgn 
391 idcrkdiisg rfnqdrftakilndpnfllq yitvlmwvll wlwiitpfv 
431 yinnfqrriksc 



48 



SUBSTITUTE SHEET (RULE 26) 



wo 95^1252 



PCT'US9S/01210 



50 / 33 



YCW2 PROTEIN 



1 rastlipppsk kqkkeaqlpr evaiipkdlp nvsikfqald tgdnvggalr 
51 vpgaisekql eellnqlngt sddpvpytfs ctiqgkkflsd pvktiditdn 
101 lysslikpgy nstedqitll ytpravfkvk 



221 
271 



qytmsflJitn 



131 pvtrsssaiaflligstjilcsafaph tssrmv 
181 tlkflbynwf/lcvswsp dgevia 



Igdalrfliiskw itslswepihlvkpgskprla 



t 



scvkwggqg 



Uy 



:gagdntariHdcdtqtpmh 
rgsmjtintirljiKdpksgqc 
sskdgtikiffdtvsrvc 
gshdptvrvjsfdi nsqg 



311 rcinilksha hwvnhlslst dyalpigafd htgkkpstpe 



351 eaqkkal enyeki ckk ngnse 
4«1 hqklvniivafspdgr 
441 dgkfistfrfliiiadvyqvawssdc 



emmvt isddytmflaoplkstkpiarmtg 
yivs Jsfdnsi klwdg r 
rUvscskdtilkvwdv 



431 rtrklsvdlp^iktklyvdv/ 3vdgkrvcsljgk£i:<«nvpl:2th 



Pig. 49 



SUBSTITUTE SHEET (RULE 26) 



W09S«12S2 



PCrAJS9SA)1210 



51 / 53 

Fig. 50 

YKL525 

1 mfksktstls ydetpnsneg drnatpvnpk eksqtkhlni pgdrsrhssi 
51 adskrsssry dggysadiip aqirfidnid ygtrlrktlh rnswsngyn 
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1 msqkqstnqn qngthqpqpv knqrtnnaag ansgqqpqqq sqgqsqqqgr 
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